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(57) Abstract 

Compounds and 
methods for the treatment 
and diagnosis of breast 
cancer are provided. The 
inventive compounds include 
polypeptides containing at 
least a portion of a breast 
tumor protein. Vaccines and 
pharmaceutical compositions 
for immunotherapy of 
breast cancer comprising 
such polypeptides, or 
polynucleotide molecules 
encoding such polypeptides, 
are also provided, together 
with polynucleotide 
molecules for preparing the 
inventive polypeptides. 
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COMPOUNDS FOR IMMUNOTHERAPY AND DIAGNOSIS 
OF BREAST CANCER AND METHODS FOR THEIR USE 

TECHNICAL FIELD 

The present invention relates generally to compositions and methods for the 
treatment and diagnosis of breast cancer. The invention is more particularly related to 
polypeptides comprising at least a portion of a protein that is preferentially expressed in 
breast tumor tissue and to polynucleotide molecules encoding such polypeptides. Such 
polypeptides may be used in vaccines and pharmaceutical compositions for treatment of 
breast cancer. Additionally such polypeptides and polynucleotides may be used in the 
immunodiagnosis of breast cancer. 

BACKGROUND OF THE INVENTION 

Breast cancer is a significant health problem for women in the United States 
and throughout the world. Although advances have been made in detection and treatment of 
the disease, breast cancer remains the second leading cause of cancer-related deaths in 
women, affecting more than 180,000 women in the United States each year. For women in 
North America, the life-time odds of getting breast cancer are now one in eight. 

No vaccine or other universally successful method for the prevention or 
treatment of breast cancer is currently available. Management of the disease currently relies 
on a combination of early diagnosis (through routine breast screening procedures) and 
aggressive treatment, which may include one or more of a variety of treatments such as 
surgery, radiotherapy, chemotherapy and hormone therapy. The course of treatment for a 
particular breast cancer is often selected based on a variety of prognostic parameters, 
including an analysis of specific tumor markers. See, e.g.. Porter- Jordan and Lippman, 
Breast Cancer 5:73-100 (1994). However, the use of established markers often leads to a 
result that is difficult to interpret, and the high mortality observed in breast cancer patients 
indicates that improvements are needed in the treatment, diagnosis and prevention of the 
disease. 



wo 99/33869 



2 



PCT/US98/27416 



Accordingly, there is a need in the art for improved methods for therapy and 
diagnosis of breast cancer. The present invention fulfills these needs and further provides 
other related advantages. 

SUMMARY OF THE INVENTION 

The present invention provides compounds and methods for immunotherapy 
of breast cancer. In one aspect, isolated polypeptides are provided comprising at least an 
immunogenic portion of a breast tumor protein or a variant of said protein that differs only in 
conservative substitutions and/or modifications, vi^herein the breast tumor protein comprises 
an amino acid sequence encoded by a polynucleotide molecule having a partial sequence 
selected from the group consisting of (a) nucleotide sequences recited in SEQ ID NOS: 3, 10, 
17, 24, 45-52 and 55-67, 72, 73, and 89-94, (b) complements of said nucleotide sequences 
and (c) sequences that hybridize to a sequence of (a) or (b) under moderately stringent 
conditions. 

In related aspects, isolated polynucleotide molecules encoding the above 
polypeptides are provided. In specific embodiments, such polynucleotide molecules have 
partial sequences provided in SEQ ID NOS: 3, 10, 17, 24, 45-52 and 55-67, 72, 73, and 89- 
94. The present invention further provides expression vectors comprising the above 
polynucleotide molecules and host cells transformed or transfected v^th such expression 
vectors. In preferred embodiments, the host cells are selected fi:om the group consisting of E. 
coli, yeast and mammalian cells. 

In another aspect, the present invention provides fusion proteins comprising a 
first and a second inventive polypeptide or, alternatively, an inventive polypeptide and a 
known breast antigen. 

The present invention also provides pharmaceutical compositions comprising 
at least one of the above polypeptides, or a polynucleotide molecule encoding such a 
polypeptide, and a physiologically acceptable carrier, together with vaccines comprising at 
least one or more such polypeptide or polynucleotide molecule in combination with a non- 
specific immune response enhancer. Pharmaceutical compositions and vaccines comprising 
one or more of the above fusion proteins are also provided. 
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In related aspects, pharmaceutical compositions for the treatment of breast 
cancer comprising at least one polypeptides and a physiologically acceptable carrier are 
provided, wherein the polypeptide comprises an immunogenic portion of a breast tumor 
protein or a variant thereof, the breast tumor protein being encoded by a polynucleotide 
molecule having a partial sequence selected from the group consisting of: (a) nucleotide 
sequences recited in SEQ ID NOS: 1, 2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, and 74-88, 
(b) complements of said nucleotide sequences, and (c) sequences that hybridize to a sequence 
of (a) or (b) under moderately stringent conditions. The invention also provides vaccines for 
the treatment of breast cancer comprising such polypeptides in combination with a non- 
specific immune response enhancer, together with pharmaceutical compositions and vaccines 
comprising at least one polynucleotide molecule having a partial sequence provided in SEQ 
ID NOS: 1, 2, 4-9, 1 1-16, 18-23, 25-44, 53, 54, 68-71, and 74-88. 

In yet another aspect, methods are provided for inhibiting the development of 
breast cancer in a patient, comprising administering an effective amount of at least one of the 
above pharmaceutical compositions and/or vaccines. 

The present invention also provides methods for immunodiagnosis of breast 
cancer, together with kits for use in such methods. In one specific aspect of the present 
invention, methods are provided for detecting breast cancer in a patient, comprising: (a) 
contacting a biological sample obtained from a patient with a binding agent that is capable of 
binding to one of the inventive polypeptides; and (b) detecting in the sample a protein or 
polypeptide that binds to the binding agent. In preferred embodiments, the binding agent is 
an antibody, most preferably a monoclonal antibody. 

In related aspects, methods are provided for monitoring the progression of 
breast cancer in a patient, comprising: (a) contacting a biological sample obtained from a 
patient with a binding agent that is capable of binding to one of the above polypeptides; (b) 
determining in the sample an amount of a protein or polypeptide that binds to the binding 
agent; (c) repeatmg steps (a) and (b); and comparing the amounts of polypeptide detected in 
steps (b) and (c). 

Within related aspects, the present invention provides antibodies, preferably 
monoclonal antibodies, that bind to the inventive polypeptides, as well as diagnostic kits 
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comprising such antibodies, and methods of using such antibodies to inhibit the development 
of breast cancer. 

The present invention further provides methods for detecting breast cancer 
comprising: (a) obtaining a biological sample from a patient; (b) contacting the sample with 
a first and a second oligonucleotide primer in a polymerase chain reaction, at least one of the 
oligonucleotide primers being specific for a DNA molecule that encodes one of the above 
polypeptides; and (c) detecting in the sample a DNA sequence that amplifies in the presence 
of the first and second oligonucleotide primers. In a preferred embodiment, at least one of the 
oligonucleotide primers comprises at least about 10 contiguous nucleotides of a DNA 
molecule having a partial sequence selected from the group consisting of SEQ ID NOS: 1 -94. 

In a further aspect, the present invention provides a method for detecting 
breast cancer in a patient comprising: (a) obtaining a biological sample from the patient; (b) 
contacting the sample with an oligonucleotide probe specific for a polynucleotide molecule 
that encodes one of the above polypeptides; and (c) detecting in the sample a polynucleotide 
sequence that hybridizes to the oligonucleotide probe. Preferably, the oligonucleotide probe 
comprises at least about 15 contiguous nucleotides of a DNA molecule having a partial 
sequence selected from the group consisting of SEQ ID NOS: 1-94. 

In related aspects, diagnostic kits comprising the above oligonucleotide probes 
or primers are provided. 

These and other aspects of the present invention will become apparent upon 
reference to the following detailed description. All references disclosed herein are hereby 
incorporated by reference in their entirety as if each was incorporated individually. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. lA and B show the specific lytic activity of a first and a second B51 IS-specific 
CTL clone, respectively, measured on autologous LCL transduced with B511s (filled 
squares) or HLA-A3 (open squares). 
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DETAILED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is generally directed to compositions 
and methods for the immunotherapy and diagnosis of breast cancer. The inventive 
compositions are generally isolated polypeptides that comprise at least a portion of a breast 
tumor protein. Also included within the present invention are molecules (such as an antibody 
or fragment thereof) that bind to the inventive polypeptides. Such molecules are referred to 
herein as "binding agents." 

In particular, the subject invention discloses polypeptides comprising at least a 
portion of a human breast tumor protein, or a variant thereof, wherein the breast tumor protein 
includes an amino acid sequence encoded by a polynucleotide molecule including a 
sequence selected from the group consisting of: nucleotide sequences recited in SEQ ID 
NOS: 1- 94, the complements of said nucleotide sequences, and variants thereof As used 
herein, the term "polypeptide" encompasses amino acid chains of any length, including full 
length proteins, wherein the amino acid residues are linked by covalent peptide bonds. Thus, 
a polypeptide comprising a portion of one of the above breast proteins may consist entirely of 
the portion, or the portion may be present within a larger polypeptide that contains additional 
sequences. The additional sequences may be derived from the native protein or may be 
heterologous, and such sequences may be immunoreactive and/or antigenic. 

As used herein, an "immunogenic portion" of a human breast tumor protein is 
a portion that is capable of eliciting an immune response in a patient inflicted with breast 
cancer and as such binds to antibodies present within sera from a breast cancer patient. Such 
immunogenic portions generally comprise at least about 5 amino acid residues, more 
preferably at least about 10, and most preferably at least about 20 amino acid residues. 
Immunogenic portions of the proteins described herein may be identified in antibody binding 
assays. Such assays may generally be performed using any of a variety of means known to 
those of ordinary skill in the art, as described, for example, in Harlow and Lane, Antibodies: 
A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1988. For 
example, a polypeptide may be inunobilized on a solid support (as described below) and 
contacted with patient sera to allow binding of antibodies within the sera to the immobilized 
polypeptide. Unbound sera may then be removed and boimd antibodies detected using, for 
example, *^^I-labeled Protein A, Alternatively, a polypeptide may be used to generate 
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monoclonal and polyclonal antibodies for use in detection of the polypeptide in blood or 
other fluids of breast cancer patients. Methods for preparing and identifying immunogenic 
portions of antigens of known sequence are well known in the art and include those 
summarized in Paul, Fundamental Immunology, 3'"* ed., Raven Press, 1993, pp. 243-247. 

The term "polynucleotide(s)," as used herein, means a single or double- 
stranded polymer of deoxyribonucleotide or ribonucleotide bases and includes DNA and 
corresponding RNA molecules, including HnRNA and mRNA molecules, both sense and 
anti-sense strands, and comprehends cDNA, genomic DNA and recombinant DNA, as well as 
wholly or partially synthesized polynucleotides. An HnRNA molecule contains introns and 
corresponds to a DNA molecule in a generally one-to-one maimer. An mRNA molecule 
corresponds to an HnRNA and DNA molecule from which the introns have been excised. A 
polynucleotide may consist of an entire gene, or any portion thereof Operable anti-sense 
polynucleotides may comprise a fragment of the corresponding polynucleotide, and the 
definition of "polynucleotide" therefore includes all such operable anti-sense fragments. 

The compositions and methods of the present invention also encompass 
variants of the above polypeptides and polynucleotides. A polypeptide "variant," as used 
herein, is a polypeptide that differs from the recited polypeptide only in conservative 
substitutions and/or modifications, such that the therapeutic, antigenic and/or immxmogenic 
properties of the polypeptide are retained. In a preferred embodiment, variant polypeptides 
differ from an identified sequence by substitution, deletion or addition of five amino acids or 
fewer. Such variants may generally be identified by modifying one of the above polypeptide 
sequences, and evaluating the antigenic properties of the modified polypeptide using, for 
example, the representative procedures described herein. Polypeptide variants preferably 
exhibit at least about 70%, more preferably at least about 90% and most preferably at least 
about 95% identity (determined as described below) to the identified polypeptides. 

As used herein, a "conservative substitution" is one in which an amino acid is 
substituted for another amino acid that has similar properties, such that one skilled in the art 
of peptide chemistry would expect the secondary structure and hydropathic nature of the 
polypeptide to be substantially unchanged. In general, the following groups of amino acids 
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represent conservative changes: (1) ala, pro, gly, glu, asp, gin, asn, ser, thr; (2) cys, ser, tyr, 
thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, his. 

Variants may also, or alternatively, contain other modifications, including the 
deletion or addition of amino acids that have minimal influence on the antigenic properties, 
secondary structure and hydropathic nature of the polypeptide. For example, a polypeptide 
may be conjugated to a signal (or leader) sequence at the N-terminal end of the protein which 
co-translationally or post-translationally directs transfer of the protein. The polypeptide may 
also be conjugated to a linker or other sequence for ease of synthesis, purification or 
identification of the polypeptide (e.g., poly-His), or to enhance binding of the polypeptide to a 
solid support. For example, a polypeptide may be conjugated to an immunoglobulin Fc 
region. 

A nucleotide "variant" is a sequence that differs from the recited nucleotide 
sequence in having one or more nucleotide deletions, substitutions or additions. Such 
modifications may be readily introduced using standard mutagenesis techniques, such as 
oligonucleotide-directed site-specific mutagenesis as taught, for example, by Adelman et al. 
(DNA, 2:183, 1983). Nucleotide variants may be naturally occurring allelic variants, or non- 
naturally occurring variants. Variant nucleotide sequences preferably exhibit at least about 
70%, more preferably at least about 80% and most preferably at least about 90% identity 
(determined as described below) to the recited sequence. 

The antigens provided by the present invention include variants that are 
encoded by DNA sequences which are substantially homologous to one or more of the DNA 
sequences specifically recited herein. "Substantial homology," as used herein, refers to DNA 
sequences that are capable of hybridizing under moderately stringent conditions. Suitable 
moderately stringent conditions include prewashing in a solution of 5X SSC, 0.5% SDS, 
1.0 mM EDTA (pH 8.0); hybridizing at 50°C-65°C, 5X SSC, overnight or, in the event of 
cross-species homology, at 45°C with 0.5X SSC; followed by washing twice at eS^'C for 20 
minutes with each of 2X, 0.5X and 0.2X SSC containing 0.1% SDS. Such hybridizing DNA 
sequences are also within the scope of this invention, as are nucleotide sequences that, due to 
code degeneracy, encode an immunogenic polypeptide that is encoded by a hybridizing DNA 
sequence. 
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Two nucleotide or polypeptide sequences are said to be "identical" if the 
sequence of nucleotides or amino acid residues in the two sequences is the same when aligned 
for maximum correspondence as described below. Comparisons between two sequences are 
typically performed by comparing the sequences over a comparison window to identify and 
compare local regions of sequence similarity. A "comparison window" as used herein, refers 
to a segment of at least about 20 contiguous positions, usually 30 to about 75, more 
preferably 40 to about 50, in which a sequence may be compared to a reference sequence of 
the same number of contiguous positions after the two sequences are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using the 
Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, Inc., 
Madison, WI), using default parameters. This program embodies several alignment schemes 
described in the following references; Dayhoff, M.O. (1978) A model of evolutionary change 
in proteins - Matrices for detecting distant relationships. In Dayhoff, M.O. (ed.) Atlas of 
Protein Sequence and Structure, National Biomedical Resarch Foundaiton, Washington DC 
Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) Unified Approach to Alignment and Phylogenes 
pp. 626-645 Methods in Enzymology vol 183, Academic Press, Inc., San Diego, CA; 
Higgins, D.G. and Sharp, P.M. (1989) Fast and sensitive multiple sequence alignments on a 
microcomputer 045/05 5:151-153; Myers, E.W. and Muller W. (1988) Optimal alignments 
in linear space CABIOS ^:11-17; Robinson, E.D. (1971) Comb, Theor 77:105; Santou, N. 
Nes, M. (1987) The neighbor joining method. A new method for reconstructing phylogenetic 
trees Mol Biol Evol 4:406-425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical 
Taxonomy - the Principles and Practice of Numerical Taxonomy, Freeman Press, San 
Francisco, CA; Wilbur, W.J. and Lipman, DJ. (1983) Rapid similarity searches of nucleic 
acid and protein data banks Proc. Natl Acad, Scl USA 50:726-730. 

Preferably, the "percentage of sequence identity" is determined by comparing 
two optimally aligned sequences over a window of comparison of at least 20 positions, 
wherein the portion of the polynucleotide sequence in the comparison v^dndow may comprise 
additions or deletions (i.e. gaps) of 20 percent or less, usually 5 to 15 percent, or 10 to 12 
percent, as compared to the reference sequences (which does not comprise additions or 
deletions) for optimal alignment of the two sequences. The percentage is calculated by 
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determining the number of positions at which the identical nucleic acid bases or amino acid 
residue occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in the reference sequence (i.e. 
the window size) and multiplying the results by 100 to yield the percentage of sequence 
identity. 

Also included in the scope of the present invention are alleles of the genes encoding 
the nucleotide sequences recited herein. As used herein, an "allele" or ''allellic sequence" is 
an alternative form of the gene which may result from at least one mutation in the nucleic 
acid sequence. Alleles may result in altered mRNAs or polypeptides whose structure or 
function may or may not be altered. Any given gene may have none, one, or many allelic 
forms. Common mutational changes which give rise to alleles are generally ascribed to 
natural deletions, additions, or substitutions of nucleotides. Each of these types of changes 
may occur alone or in combination with the others, one or more times in a given sequence. 

For breast tumor polypeptides with immunoreactive properties, variants may, 
alternatively, be identified by modifying the amino acid sequence of one of the above 
polypeptides, and evaluating the immunoreactivity of the modified polypeptide. For breast 
tumor polypeptides useful for the generation of diagnostic binding agents, a variant may be 
identified by evaluating a modified polypeptide for the ability to generate antibodies that 
detect the presence or absence of breast cancer. Such modified sequences may be prepared 
and tested using, for example, the representative procedures described herein. 

The breast tumor proteins of the present invention, and polynucleotide 
molecules encoding such proteins, may be isolated from breast tumor tissue using any of a 
variety of methods well known in the art. Polynucleotide sequences corresponding to a gene 
(or a portion thereof) encoding one of the inventive breast tumor proteins may be isolated 
from a breast tumor cDNA library using a subtraction technique as described in detail below. 
Examples of such DNA sequences are provided in SEQ ID NOS: 1- 94. Partial 
polynucleotide sequences thus obtained may be used to design oligonucleotide primers for 
the amplification of full-length polynucleotide sequences in a polymerase chain reaction 
(PCR), using techniques well known in the art (see, for example, MuUis et al., Cold Spring 
Harbor Symp. Quant. Biol, 57:263, 1987; Erlich ed., PCR Technology, Stockton Press, NY, 
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1989). Once a polynucleotide sequence encoding a polypeptide is obtained, any of the above 
modifications may be readily introduced using standard mutagenesis techniques, such as 
oligonucleotide-directed site-specific mutagenesis as taught, for example, by Adelman et al. 
{DNA, 2:183, 1983). 

The breast tumor polypeptides disclosed herein may also be generated by 
synthetic or recombinant means. Synthetic polypeptides having fewer than about 100 amino 
acids, and generally fewer than about 50 amino acids, may be generated using techniques 
well known to those of ordinary skill in the art. For example, such polypeptides may be 
synthesized using any of the commercially available solid-phase techniques, such as the 
Merrifield solid-phase synthesis method, where amino acids are sequentially added to a 
growing amino acid chain (see, for example, Merrifield, J. Am. Chem. Soc, S5:2 149-2 146, 
1963). Equipment for automated synthesis of polypeptides is commercially available from 
suppliers such as Perkin Elmer/ Applied BioSystems Division (Foster City, CA), and may be 
operated according to the manufacturer's instructions. 

Alternatively, any of the above polypeptides may be produced recombinantly 
by inserting a polynucleotide sequence that encodes the polypeptide into an expression 
vector and expressing the protein in an appropriate host. Any of a variety of expression 
vectors known to those of ordinary skill in the art may be employed to express recombinant 
polypeptides of this invention. Expression may be achieved in any appropriate host cell that 
has been transformed or transfected with an expression vector containing a polynucleotide 
molecule that encodes a recombinant polypeptide. Suitable host cells include prokaryotes, 
yeast and higher eukaryotic cells. Preferably, the host cells employed are E. coliy yeast or a 
manunalian cell line, such as CHO cells. The polynucleotide sequences expressed in this 
manner may encode naturally occurring polypeptides, portions of naturally occurring 
polypeptides, or other variants thereof. 

In general, regardless of the method of preparation, the polypeptides disclosed 
herein are prepared in an isolated, substantially pure form {i.e., the polypeptides are 
homogenous as determined by amino acid composition and primary sequence analysis). 
Preferably, the polypeptides are at least about 90% pure, more preferably at least about 95% 
pure and most preferably at least about 99% pure. In certain preferred embodiments, 
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described in more detail below, the substantially pure polypeptides are incorporated into 
pharmaceutical compositions or vaccines for use in one or more of the methods disclosed 
herein. 

In a related aspect, the present invention provides fusion proteins comprising a 
first and a second inventive polypeptide or, alternatively, a polypeptide of the present 
invention and a known breast tumor antigen, together with variants of such fusion proteins. 

A polynucleotide sequence encoding a fusion protein of the present invention 
is constructed using known recombinant DNA techniques to assemble separate 
polynucleotide sequences encoding the first and second polypeptides into an appropriate 
expression vector. The 3' end of a polynucleotide sequence encoding the first polypeptide is 
ligated, with or without a peptide linker, to the 5' end of a polynucleotide sequence encoding 
the second polypeptide so that the reading frames of the sequences are in phase to permit 
mRNA translation of the two DNA sequences into a single fusion protein that retains the 
biological activity of both the first and the second polypeptides. 

A peptide linker sequence may be employed to separate the first and the 
second polypeptides by a distance sufficient to ensure that each polypeptide folds into its 
secondary and tertiary structures. Such a peptide linker sequence is incorporated into the 
fusion protein using standard techniques well known in the art. Suitable peptide linker 
sequences may be chosen based on the following factors: (1) their ability to adopt a flexible 
extended conformation; (2) their inability to adopt a secondary structure that could interact 
with functional epitopes on the first and second polypeptides; and (3) the lack of hydrophobic 
or charged residues that might react with the polypeptide functional epitopes. Preferred 
peptide linker sequences contain Gly, Asn and Ser residues. Other near neutral amino acids, 
such as Thr and Ala may also be used in the linker sequence. Amino acid sequences which 
may be usefiilly employed as linkers include those disclosed in Maratea et al.. Gene -^0:39-46, 
1985; Murphy etal., Proc, Natl Acad Sci. USA 55:8258-8262, 1986; U.S. Patent 
No, 4,935,233 and U.S. Patent No. 4,751,180. The linker sequence may be from 1 to about 
50 amino acids in length. Peptide sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to separate 
the fimctional domains and prevent steric interference. 
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The ligated polynucleotide sequences are operably linked to suitable 
transcriptional or translational regulatory elements. The regulatory elements responsible for 
expression of polynucleotides are located only 5' to the polynucleotide sequence encoding 
the first polypeptides. Similarly, stop codons require to end translation and transcription 
termination signals are only present 3' to the polynucleotide sequence encoding the second 
polypeptide. 

Fusion proteins are also provided that comprise a polypeptide of the present 
invention together with an unrelated immunogenic protein. Preferabh' the immunogenic 
protein is capable of eliciting a recall response. Examples of such proteins include tetanus, 
tuberculosis and hepatitis proteins (see, for example, Stoute et al. New Engl J. Med., 55(5:86- 
91 (1997)). 

Polypeptides of the present invention that comprise an immunogenic portion 
of a breast tumor protein may generally be used for immunotherapy of breast cancer, wherein 
the polypeptide stimulates the patient's own immune response to breast tumor cells. In 
further aspects, the present invention provides methods for using one or more of the 
immunoreactive polypeptides encoded by a polynucleotide molecule having a sequence 
provided in SEQ ID NOS: 1- 94 (or fiision proteins comprising one or more such 
polypeptides and/or polynucleotides encoding such polypeptides) for immunotherapy of 
breast cancer in a patient. As used herein, a "patient" refers to any warm-blooded animal, 
preferably a human. A patient may be afflicted with a disease, or may be fi-ee of detectable 
disease. Accordingly, the above immunoreactive polypeptides (or fusion proteins or 
polynucleotide molecules encoding such polypeptides) may be used to treat breast cancer or 
to inhibit the development of breast cancer. The polypeptides may be administered either 
prior to or following surgical removal of primary tumors and/or treatment by administration 
of radiotherapy and conventional chemotherapeutic drugs. 

In these aspects, the polypeptide or fusion protein is generally present within a 
pharmaceutical composition and/or a vaccine. Pharmaceutical compositions may comprise 
one or more polypeptides, each of which may contain one or more of the above sequences (or 
variants thereof), and a physiologically acceptable carrier. The vaccines may comprise one or 
more of such polypeptides and a non-specific immime response enhancer, wherein the non- 
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specific immune response enhancer is capable of eliciting or enhancing an immune response 
to an exogenous antigen. Examples of non-specific-immime response enhancers include 
adjuvants, biodegradable microspheres (e.g., polylactic galactide) and liposomes (into which 
the polypeptide is incorporated). Pharmaceutical compositions and vaccines may also contain 
other epitopes of breast tumor antigens, either incorporated into a combination polypeptide 
(i.e., a single polypeptide that contains multiple epitopes) or present within a separate 
polypeptide. 

Alternatively, a pharmaceutical composition or vaccine may contain 
polynucleotides encoding one or more of the above polypeptides, such that the polypeptide is 
generated in situ. In such pharmaceutical compositions and vaccines, the polynucleotide 
may be present within any of a variety of delivery systems known to those of ordinary skill in 
the art, including nucleic acid expression systems, bacteria and viral expression systems. 
Appropriate nucleic acid expression systems contain the necessary polynucleotide sequences 
for expression in the patient (such as a suitable promoter). Bacterial delivery systems involve 
the administration of a bacterium (such as BaciUus-Calmette-Guerrin) that expresses an 
epitope of a breast tumor cell antigen on its cell surface. In a preferred embodiment, the 
polynucleotide molecules may be introduced using a viral expression system (e.g., vaccinia 
or other pox virus, retrovirus, or adenovirus), which may involve the use of a non-pathogenic 
(defective), replication competent virus. Suitable systems are disclosed, for example, in 
Fisher-Hoch et al., PNAS 8^:317-321, 1989; Flexner et al., Ann, N.Y. Acad ScL 5(5P:86-103, 
1989; Flexner et al.. Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330, and 
5,017,487; WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; 
WO 91/02805; Berkner, Biotechniques 6:616-627, 1988; Rosenfeld et al.. Science 
252:431-434, 1991; Kolls et al., PNAS P7:215-219, 1994; Kass-Eisler et al., PNAS 
90:11498-11502, 1993; Guzman et al., Circulation 55:2838-2848, 1993; and Guzman et al., 
C/r. Res. 75:1202-1207, 1993. Techniques for incorporating polynucleotides into such 
expression systems are well known to those of ordinary skill in the art. The polynucleotides 
may also be "naked," as described, for example, in published PCT application WO 90/1 1092, 
and Ulmer et al., Science 2J9: 1745- 1749, 1993, reviewed by Cohen, Science 2JP: 1691 -1692, 
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1993. The uptake of naked polynucleotides may be increased by coating the polynucleotides 
onto biodegradable beads, which are efficiently transported into the cells. 

Routes and frequency of administration, as well as dosage, will vary from 
individual to individual and may parallel those currently being used in immunotherapy of 
other diseases. In general, the pharmaceutical compositions and vaccines may be 
administered by injection {e.g., intracutaneous, intramuscular, intravenous or subcutaneous), 
intranasally {e.g., by aspiration) or orally. Between 1 and 10 doses may be administered over 
a 3-24 week period. Preferably, 4 doses are administered, at an interval of 3 months, and 
booster administrations may be given periodically thereafter. Alternate protocols may be 
appropriate for individual patients. A suitable dose is an amount of polypeptide or 
polynucleotide molecule that is effective to raise an immune response (cellular and/or 
humoral) against breast tumor cells in a treated patient. A suitable immune response is at 
least 10-50% above the basal {i.e., untreated) level. In general, the amount of polypeptide 
present in a dose (or produced in situ by the polynucleotide in a dose) ranges from about 1 pg 
to about 100 mg per kg of host, typically from about 10 pg to about 1 mg, and preferably 
from about 100 pg to about 1 ixg. Suitable dose sizes will vary with the size of the patient, 
but will typically range from about 0.01 mL to about 5 mL. 

While any suitable carrier known to those of ordinary skill in the art may be 
employed in the pharmaceutical compositions of this invention, the type of carrier will vary 
depending on the mode of administration. For parenteral administration, such as 
subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a lipid, a wax 
and/or a buffer. For oral administration, any of the above carriers or a solid carrier, such as 
mannitol, lactose, starch, magnesium stearate, sodium saccharine, talcum, cellulose, glucose, 
sucrose, and/or magnesium carbonate, may be employed. Biodegradable microspheres (e.g. , 
polylactic glycolide) may also be employed as carriers for the pharmaceutical compositions 
of this invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 
Patent Nos. 4,897,268 and 5,075,109. 

Any of a variety of non-specific immune response enhancers may be employed 
in the vaccines of this invention. For example, an adjuvant may be included. Most adjuvants 
contain a substance designed to protect the antigen from rapid catabolism, such as aluminum 
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hydroxide or mineral oil, and a nonspecific stimulator of immune response, such as lipid A, 
Bordella pertussis or Mycobacterium tuberculosis. Such adjuvants are commercially 
available as, for example, Freund's Incomplete Adjuvant and Complete Adjuvant (Difco 
Laboratories, Detroit, MI) and Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ). 

Polypeptides disclosed herein may also be employed in adoptive 
immunotherapy for the treatment of cancer. Adoptive immunotherapy may be broadly 
classified into either active or passive immunotherapy. In active immunotherapy, treatment 
relies on the in vivo stimulation of the endogenous host immime system to react against 
tumors with the administration of immune response-modifying agents (for example, tumor 
vaccines, bacterial adjuvants, and/or cytokines). 

In passive immunotherapy, treatment involves the delivery of biologic 
reagents with established tumor-immune reactivity (such as effector cells or antibodies) that 
can directly or indirectly mediate antitumor effects and does not necessarily depend on an 
intact host immune system. Examples of effector cells include T lymphocytes (for example, 
CD8-1- cytotoxic T-lymphocyte, CD4+ T-helper, tumor-infiltrating lymphocytes), killer cells 
(such as Natural Killer cells, lymphokine-activated killer cells), B cells, or antigen presenting 
cells (such as dendritic cells and macrophages) expressing the disclosed antigens. The 
polypeptides disclosed herein may also be used to generate antibodies or anti-idiotypic 
antibodies (as in U.S. Patent No. 4,918,164), for passive immunotherapy. 

The predominant method of procuring adequate numbers of T-cells for 
adoptive immunotherapy is to grow immune T-cells in vitro. Culture conditions for 
expanding single antigen-specific T-cells to several billion in number with retention of 
antigen recognition in vivo are well known in the art. These in vitro culture conditions 
typically utilize intermittent stimulation with antigen, often in the presence of cytokines, such 
as IL-2, and non-dividing feeder cells. As noted above, the immunoreactive polypeptides 
described herein may be used to rapidly expand antigen-specific T cell cultures in order to 
generate sufficient number of cells for immunotherapy. In particular, antigen-presenting 
cells, such as dendritic, macrophage or B-cells, may be pulsed with inmiunoreactive 
polypeptides or transfected with a polynucleotide sequence(s), using standard techniques well 
known in the art. For example, antigen presenting cells may be transfected wdth a 
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polynucleotide sequence, wherein said sequence contains a promoter region appropriate for 
increasing expression, and can be expressed as part of a recombinant virus or other expression 
system. For cultured T-celis to be effective in therapy, the cultured T-cells must be able to 
grow and distribute widely and to survive long term in vivo. Studies have demonstrated that 
cultured T-cells can be induced to grow in vivo and to survive long term in substantial 
numbers by repeated stimulation with antigen supplemented v^th IL-2 (see, for example, 
Cheever, M., et al, "Therapy With Cultured T Cells: Principles Revisited " Immunological 
Reviews, 757:177, 1997). 

The polypeptides disclosed herein may also be employed to generate and/or 
isolate tumor-reactive T-cells, which can then be administered to the patient. In one 
technique, antigen-specific T-cell lines may be generated by in vivo immunization with short 
peptides corresponding to immunogenic portions of the disclosed polypeptides. The resulting 
antigen specific CD8+ CTL clones may be isolated from the patient, expanded using standard 
tissue culture techniques, and returned to the patient. 

Alternatively, peptides corresponding to immunogenic portions of the 
polypeptides may be employed to generate tumor reactive T cell subsets by selective in vitro 
stimulation and expansion of autologous T cells to provide antigen-specific T cells which 
may be subsequently transferred to the patient as described, for example, by Chang et al. 
(Crit Rev. Oncol HematoL 22(3), 213, 1996), Cells of the immune system, such as T cells, 
may be isolated from the peripheral blood of a patient, using a commercially available cell 
separation system, such as CellPro Incorporated's (Bothell, WA) CEPRATE™ system (see 
U.S. Patent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 91/161 16 and WO 
92/07243). The separated cells are stimulated with one or more of the immunoreactive 
polypeptides contained v^thin a delivery vehicle, such as a microsphere, to provide antigen- 
specific T cells. The population of tumor antigen-specific T cells is then expanded using 
standard techniques and the cells are administered back to the patient. 

In another embodiment, T-cell and/or antibody receptors specific for the 
polypeptides can be cloned, expanded, and transferred into other vectors or effector cells for 
use in adoptive immunotherapy. 

In a further embodiment, syngeneic or autologous dendritic cells may be 
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pulsed with peptides corresponding to at least an immunogenic portion of a polypeptide 
disclosed herein. The resulting antigen-specific dendritic cells may either be transferred into 
a patient, or employed to stimulate T cells to provide antigen-specific T cells which may, in 
turn, be administered to a patient. The use of peptide-pulsed dendritic cells to generate 
antigen-specific T cells and the subsequent use of such antigen-specific T cells to eradicate 
tumors in a murine model has been demonstrated by Cheever et al, Immunological Reviews, 
757:177, 1997). 

Additionally, vectors expressing the disclosed polynucleotides may be introduced into 
stem cells taken from the patient and clonally propagated in vitro for autologous transplant 
back into the same patient. 

Polypeptides of the present invention may also, or alternatively, be used to 
generate binding agents, such as antibodies or fi:agments thereof, that are capable of detecting 
metastatic human breast tumors. Binding agents of the present invention may generally be 
prepared using methods known to those of ordinary skill in the art, including the 
representative procedures described herein. Binding agents are capable of differentiating 
between patients with and without breast cancer, using the representative assays described 
herein. In other words, antibodies or other binding agents raised against a breast tumor 
protein, or a suitable portion thereof, will generate a signal indicating the presence of primary 
or metastatic breast cancer in at least about 20% of patients afflicted with the disease, and will 
generate a negative signal indicating the absence of the disease in at least about 90% of 
individuals without primary or metastatic breast cancer. Suitable portions of such breast 
tumor proteins are portions that are able to generate a binding agent that indicates the 
presence of primary or metastatic breast cancer in substantially all {i.e., at least about 80%, 
and preferably at least about 90%) of the patients for which breast cancer would be indicated 
using the full length protein, and that indicate the absence of breast cancer in substantially all 
of those samples that would be negative when tested with full length protein. The 
representative assays described below, such as the two-antibody sandwich assay, may 
generally be employed for evaluating the ability of a binding agent to detect metastatic human 
breast tumors. 



wo 99/33869 



18 



PCT/US98/27416 



The ability of a polypeptide prepared as described herein to generate 
antibodies capable of detecting primary or metastatic human breast tumors may generally be 
evaluated by raising one or more antibodies against the polypeptide (using, for example, a 
representative method described herein) and determining the ability of such antibodies to 
detect such tumors in patients. This determination may be made by assaying biological 
samples from patients with and v^thout primary or metastatic breast cancer for the presence 
of a polypeptide that binds to the generated antibodies. Such test assays may be performed, 
for example, using a representative procedure described below. Polypeptides that generate 
antibodies capable of detecting at least 20% of primary or metastatic breast tumors by such 
procedures are considered to be useful in assays for detecting primary or metastatic htmian 
breast tumors. Polypeptide specific antibodies may be used alone or in combination to 
improve sensitivity. 

Polypeptides capable of detecting primary or metastatic human breast tumors 
may be used as markers for diagnosing breast cancer or for monitoring disease progression in 
patients. In one embodiment, breast cancer in a patient may be diagnosed by evaluating a 
biological sample obtained from the patient for the level of one or more of the above 
polypeptides, relative to a predetermined cut-off value. As used herein, suitable "biological 
samples" include blood, sera and urine. 

The level of one or more of the above polypeptides may be evaluated using 
any binding agent specific for the polypeptide(s). A "binding agent," in the context of this 
invention, is any agent (such as a compound or a cell) that binds to a polypeptide as described 
above. As used herein, "binding" refers to a noncovalent association between two separate 
molecules (each of which may be fi-ee (/.e., in solution) or present on the stirface of a cell or a 
solid support), such that a "complex" is formed. Such a complex may be free or immobilized 
(either covalently or noncovalently) on a support material. The ability to bind may generally 
be evaluated by determining a binding constant for the formation of the complex. The 
binding constant is the value obtained when the concentration of the complex is divided by 
the product of the component concentrations. In general, two compounds are said to "bind" 
in the context of the present invention when the binding constant for complex formation 
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exceeds about lO^L/mol. The binding constant may be determined using methods well 
known to those of ordinary skill in the art. 

Any agent that satisfies the above requirements may be a binding agent. For 
example, a binding agent may be a ribosome with or without a peptide component, an RNA 
molecule or a peptide. In a preferred embodiment, the binding partner is an antibody, or a 
fragment thereof. Such antibodies may be polyclonal, or monoclonal. In addition, the 
antibodies may be single chain, chimeric, CDR-grafted or humanized. Antibodies may be 
prepared by the methods described herein and by other methods well known to those of skill 
in the art. 

There are a variety of assay formats known to those of ordinary skill in the art 
for using a binding partner to detect polypeptide markers in a sample. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In a 
preferred embodiment, the assay involves the use of binding partner immobilized on a solid 
support to bind to and remove the polypeptide from the remainder of the sample. The bound 
polypeptide may then be detected using a second binding partner that contains a reporter 
group. Suitable second binding partners include antibodies that bind to the binding 
partner/polypeptide complex. Alternatively, a competitive assay may be utilized, in which a 
polypeptide is labeled with a reporter group and allowed to bind to the immobilized binding 
partner after incubation of the binding partner with the sample. The extent to which 
components of the sample inhibit the binding of the labeled polypeptide to the binding 
partner is indicative of the reactivity of the sample with the immobilized binding partner. 

The solid support may be any material known to those of ordinary skill in the 
art to which the antigen may be attached. For example, the solid support may be a test well in 
a microtiter plate or a nitrocellulose or other suitable membrane. Alternatively, the support 
may be a bead or disc, such as glass, fiberglass, latex or a plastic material such as polystyrene 
or polyvinylchloride. The support may also be a magnetic particle or a fiber optic sensor, 
such as those disclosed, for example, in U.S. Patent No. 5,359,681. The binding agent may 
be immobilized on the solid support using a variety of techniques known to those of skill in 
the art, which are amply described in the patent and scientific literature. In the context of the 
present invention, the term "immobilization" refers to both noncovalent association, such as 
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adsorption, and covalent attachment (which may be a direct linkage between the antigen and 
functional groups on the support or may be a linkage by way of a cross-linking agent). 
Inunobilization by adsorption to a well in a microtiter plate or to a membrane is preferred. In 
such cases, adsorption may be achieved by contacting the binding agent, in a suitable buffer, 
with the solid support for a suitable amount of time. The contact time varies with 
temperature, but is typically between about 1 hour and about 1 day. In general, contacting a 
well of a plastic microtiter plate (such as polystyrene or polyvinylchloride) with an amount of 
binding agent ranging from about 10 ng to about 10 ^ig, and preferably about 100 ng to about 
1 |ig, is sufficient to immobilize an adequate amount of binding agent. 

Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a bifunctional reagent that will react with both the 
support and a functional group, such as a hydroxyl or amino group, on the binding agent. For 
example, the binding agent may be covalently attached to supports having an appropriate 
polymer coating using benzoquinone or by condensation of an aldehyde group on the support 
with an amine and an active hydrogen on the binding partner {see, e.g., Pierce 
Immunotechnology Catalog and Handbook, 1991, at A12-A13). 

In certain embodiments, the assay is a two-antibody sandwich assay. This 
assay may be performed by first contacting an antibody that has been immobilized on a solid 
support, commonly the well of a microtiter plate, with the sample, such that polypeptides 
within the sample are allowed to bind to the immobilized antibody. Unbound sample is dien 
removed from the immobilized polypeptide-antibody complexes and a second antibody 
(containing a reporter group) capable of binding to a different site on the polypeptide is 
added. The amount of second antibody that remains bound to the solid support is then 
determined using a method appropriate for the specific reporter group. 

More specifically, once the antibody is immobilized on the support as 
described above, the remaining protein binding sites on the support are typically blocked. 
Any suitable blocking agent known to those of ordinary skill in the art, such as bovine serum 
albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The immobilized antibody is 
then incubated with the sample, and polypeptide is allowed to bind to the antibody. The 
sample may be diluted with a suitable diluent, such as phosphate-buffered saline (PBS) prior 



wo 99/33869 



21 



PCT/US98/27416 



to incubation. In general, an appropriate contact time {i.e., incubation time) is that period of 
time that is sufficient to detect the presence of polypeptide within a sample obtained from an 
individual with breast cancer. Preferably, the contact time is sufficient to achieve a level of 
binding that is at least about 95% of that achieved at equilibrium between bound and unbound 
polypeptide. Those of ordinary skill in the art will recognize that the time necessary to 
achieve equilibrium may be readily determined by assaying the level of binding that occurs 
over a period of time. At room temperature, an incubation time of about 30 minutes is 
generally sufficient. 

Unbound sample may then be removed by washing the solid support with an 
appropriate buffer, such as PBS containing 0.1% Tween 20'^^. The second antibody, which 
contains a reporter group, may then be added to the solid support. Preferred reporter groups 
include enzymes (such as horseradish peroxidase), substrates, cofactors, inhibitors, dyes, 
radionuclides, luminescent groups, fluorescent groups and biotin. The conjugation of 
antibody to reporter group may be achieved using standard methods known to those of 
ordinary skill in the art. 

The second antibody is then incubated with the inunobilized antibody- 
polypeptide complex for an amount of time sufficient to detect the bound polypeptide. An 
appropriate amount of time may generally be determined by assaying the level of binding that 
occurs over a period of time. Unbound second antibody is then removed and bound second 
antibody is detected using the reporter group. The method employed for detecting the 
reporter group depends upon the nature of the reporter group. For radioactive groups, 
scintillation counting or autoradiographic methods are generally appropriate. Spectroscopic 
methods may be used to detect dyes, luminescent groups and fluorescent groups. Biotin may 
be detected using avidin, coupled to a different reporter group (commonly a radioactive or 
fluorescent group or an enzyme). Enzyme reporter groups may generally be detected by the 
addition of substrate (generally for a specific period of time), followed by spectroscopic or 
other analysis of the reaction products. 

To determine the presence or absence of breast cancer, the signal detected 
from the reporter group that remains bound to the solid support is generally compared to a 
signal that corresponds to a predetermined cut-off value. In one preferred embodiment, the 
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cut-off value is the average mean signal obtained when the immobilized antibody is incubated 
with samples from patients without breast cancer. In general, a sample generating a signal 
that is three standard deviations above the predetermined cut-off value is considered positive 
for breast cancer. In an alternate preferred embodiment, the cut-off value is determined using 
a Receiver Operator Curve, according to the method of Sackett et al., Clinical Epidemiology: 
A Basic Science for Clinical Medicine, Little Brown and Co., 1985, p. 106-7. Briefly, in this 
embodiment, the cut-off value may be determined from a plot of pairs of true positive rates 
(i.e., sensitivity) and false positive rates (100%-specificity) that correspond to each possible 
cut-off value for the diagnostic test result. The cut-off value on the plot that is the closest to 
the upper left-hand comer (i.e., the value that encloses the largest area) is the most accurate 
cut-off value, and a sample generating a signal that is higher than the cut-off value 
determined by this method may be considered positive. Alternatively, the cut-off value may 
be shifted to the left along the plot, to minimize the false positive rate, or to the right, to 
minimize the false negative rate. In general, a sample generating a signal that is higher than 
the cut-off value determined by this method is considered positive for breast cancer. 

In a related embodiment, the assay is performed in a flow-through or strip test 
format, wherein the antibody is immobilized on a membrane, such as nitrocellulose. In the 
flow-through test, polypeptides within the sample bind to the immobilized antibody as the 
sample passes through the membrane. A second, labeled antibody then binds to the antibody- 
polypeptide complex as a solution containing the second antibody flows through the 
membrane. The detection of bound second antibody may then be performed as described 
above. In the strip test format, one end of the membrane to which antibody is bound is 
inmiersed in a solution containing the sample. The sample migrates along the membrane 
through a region containing second antibody and to the area of immobilized antibody. 
Concentration of second antibody at the area of immobilized antibody indicates the presence 
of breast cancer. Typically, the concentration of second antibody at that site generates a 
pattern, such as a line, that can be read visually. The absence of such a pattern indicates a 
negative result. In general, the amount of antibody immobilized on the membrane is selected 
to generate a visually discernible pattern when the biological sample contains a level of 
polypeptide that would be sufficient to generate a positive signal in the two-antibody 
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sandwich assay, in the format discussed above. Preferably, the amount of antibody 
immobihzed on the membrane ranges from about 25 ng to about 1 |ig, and more preferably 
from about 50 ng to about 500 ng. Such tests can typically be performed with a very small 
amount of biological sample. 

Of course, numerous other assay protocols exist that are suitable for use with 
the antigens or antibodies of the present invention. The above descriptions are intended to be 
exemplary only. 

In another embodiment, the above polypeptides may be used as markers for 
the progression of breast cancer. In this embodiment, assays as described above for the 
diagnosis of breast csmcer may be performed over time, and the change in the level of reactive 
polypeptide(s) evaluated. For example, the assays may be performed every 24-72 hours for a 
period of 6 months to 1 year, and thereafter performed as needed. In general, breast cancer is 
progressing in those patients in whom the level of polypeptide detected by the binding agent 
increases over time. In contrast, breast cancer is not progressing when the level of reactive 
polypeptide either remains constant or decreases with time. 

Antibodies for use in the above methods may be prepared by any of a variety 
of techniques known to those of ordinary skill in the art. See, e.g., Harlow and Lane, 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In one such 
technique, an immunogen comprising the antigenic polypeptide is initially injected into any 
of a wide variety of manmials {e.g., mice, rats, rabbits, sheep and goats). In this step, the 
polypeptides of this invention may serve as the inmiunogen without modification. 
Alternatively, particularly for relatively short polypeptides, a superior immune response may 
be elicited if the polypeptide is joined to a carrier protein, such as bovine serum albumin or 
keyhole limpet hemocyanin. The immunogen is injected into the animal host, preferably 
according to a predetermined schedule incorporating one or more booster immunizations, and 
the animals are bled periodically. Polyclonal antibodies specific for the polypeptide may then 
be purified from such antisera by, for example, affinity chromatography using the polypeptide 
coupled to a suitable solid support. 

Monoclonal antibodies specific for the antigenic polypeptide of interest may 
be prepared, for example, using the technique of Kohler and Milstein, Eur. J. Immunol 
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6:5\ 1-519, 1976, and improvements thereto. Briefly, these methods involve the preparation 
of inmiortal cell lines capable of producing antibodies having the desired specificity (/.e., 
reactivity with the polypeptide of interest). Such cell lines may be produced, for example, 
from spleen cells obtained from an animal immunized as described above. The spleen cells 
are then immortalized by, for example, fusion with a myeloma cell fusion partner, preferably 
one that is syngeneic with the immunized animal. A variety of fusion techniques may be 
employed. For example, the spleen cells and myeloma cells may be combined with a 
nonionic detergent for a few minutes and then plated at low density on a selective medium 
that supports the growth of hybrid cells, but not myeloma cells. A preferred selection 
technique uses HAT (hypoxanthine, aminopterin, thymidine) selection. After a sufficient 
time, usually about 1 to 2 weeks, colonies of hybrids are observed. Single colonies are 
selected and tested for binding activity against the polypeptide. Hybridomas having high 
reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supematants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the yield, 
such as injection of the hybridoma cell line into the peritoneal cavity of a suitable vertebrate 
host, such as a mouse. Monoclonal antibodies may then be harvested from the ascites fluid or 
the blood. Contaminants may be removed from the antibodies by conventional techniques, 
such as chromatography, gel filtration, precipitation, and extraction. The polypeptides of this 
invention may be used in the purification process in, for example, an affinity chromatography 
step. 

Monoclonal antibodies of the present invention may also be used as 
therapeutic reagents, to diminish or eliminate breast tumors. The antibodies may be used on 
their own (for instance, to inhibit metastases) or coupled to one or more therapeutic agents. 
Suitable agents in this regard include radionuclides, differentiation inducers, drugs, toxins, 
and derivatives thereof Preferred radionuclides include '^Y, '^X '^X ''X "'Re, *''Re, '^^At, 
and ^"Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 
Preferred differentiation inducers include phorbol esters and butyric acid. Preferred toxins 
include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas exotoxin, Shigella 
toxin, and pokeweed antiviral protein, 
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A therapeutic agent may be coupled (e.g., covalently bonded) to a suitable 
monoclonal antibody either directly or indirectly (e.g., via a linker group), A direct reaction 
between an agent and an antibody is possible when each possesses a substituent capable of 
reacting with the other. For example, a nucleophilic group, such as an amino or sulfhydryl 
group, on one may be capable of reacting with a carbonyl-containing group, such as an 
anhydride or an acid halide, or with an alkyl group containing a good leaving group {e.g., a 
halide) on the other. 

Alternatively, it may be desirable to couple a therapeutic agent and an 
antibody via a linker group. A linker group can function as a spacer to distance an antibody 
from an agent in order to avoid interference with binding capabilities. A linker group can 
also serve to increase the chemical reactivity of a substituent on an agent or an antibody, and 
thus increase the coupling efficiency. An increase in chemical reactivity may also facilitate 
the use of agents, or functional groups on agents, which otherwise would not be possible. 

It will be evident to those skilled in the art that a variety of bifunctional or 
polyfunctional reagents, both homo- and hetero-functional (such as those described in the 
catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker group. 
Coupling may be effected, for example, through amino groups, carboxyl groups, sulfhydryl 
groups or oxidized carbohydrate residues. There are numerous references describing such 
methodology, eg., U.S. Patent No. 4,671,958, to Rodwell et al. 

Where a therapeutic agent is more potent when free from the antibody portion 
of the immunoconjugates of the present invention, it may be desirable to use a linker group 
which is cleavable during or upon internalization into a cell. A number of different cleavable 
linker groups have been described. The mechanisms for the intracellular release of an agent 
from these linker groups include cleavage by reduction of a disulfide bond (e.g., U.S. Patent 
No. 4,489,710, to Spitler), by irradiation of a photolabile bond (e.g., U.S, Patent 
No. 4,625,014, to Senter et al.), by hydrolysis of derivatized amino acid side chains {e.g., U.S. 
Patent No. 4,638,045, to Kohn et al.), by serum complement-mediated hydrolysis (e.g., U.S. 
Patent No. 4,671,958, to Rodwell etal.), and acid-catalyzed hydrolysis (e.g., U.S. Patent 
No. 4,569,789, to Blattler et al.). 
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It may be desirable to couple more than one agent to an antibody. In one 
embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 
Regardless of the particular embodiment, immunoconjugates with more than one agent may 
be prepared in a variety of ways. For example, more than one agent may be coupled directly 
to an antibody molecule, or linkers which provide multiple sites for attachment can be used. 
Alternatively, a carrier can be used. 

A carrier may bear the agents in a variety of ways, including covalent bonding 
either directly or via a linker group. Suitable carriers include proteins such as albumins {e.g., 
U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides such as aminodextran 
{e.g., U.S. Patent No. 4,699,784, to Shih et al.). A carrier may also bear an agent by 
noncovalent bonding or by encapsulation, such as within a liposome vesicle {e.g., U.S. Patent 
Nos. 4,429,008 and 4,873,088). Carriers specific for radionuclide agents include 
radiohalogenated small molecules and chelating compounds. For example, U.S. Patent No. 
4,735,792 discloses representative radiohalogenated small molecules and their synthesis. A 
radionuclide chelate may be formed from chelating compounds that include those containing 
nitrogen and sulfur atoms as the donor atoms for binding the metal, or metal oxide, 
radionuclide. For example, U.S. Patent No. 4,673,562, to Davison etal. discloses 
representative chelating compounds and their synthesis. 

A variety of routes of administration for the antibodies and immunoconjugates 
may be used. Typically, administration will be intravenous, intramuscular, subcutaneous or 
in the bed of a resected tumor. It will be evident that the precise dose of the 
antibody/immunoconjugate will vary depending upon the antibody used, the antigen density 
on the tumor, and the rate of clearance of the antibody. 

Diagnostic reagents of the present invention may also comprise polynucleotide 
sequences encoding one or more of the above polypeptides, or one or more portions thereof. 
For example, at least two oligonucleotide primers may be employed in a polymerase chain 
reaction (PGR) based assay to amplify breast tumor-specific cDNA derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for a DNA molecule 
encoding a breast tumor protein of the present invention. The presence of the amplified 
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cDNA is then detected using techniques well known in the art, such as gel electrophoresis. 
Similarly, oligonucleotide probes specific for a DNA molecule encoding a breast tumor 
protein of the present invention may be used in a hybridization assay to detect the presence of 
an inventive polypeptide in a biological sample. 

As used herein, the term "oligonucleotide primer/probe specific for a DNA 
molecule" means an oligonucleotide sequence that has at least about 60%, preferably at least 
about 75% and more preferably at least about 90%, identity to the DNA molecule in question. 
Oligonucleotide primers and/or probes which may be usefully employed in the inventive 
diagnostic methods preferably have at least about 10-40 nucleotides. In a preferred 
embodiment, the oligonucleotide primers comprise at least about 1 0 contiguous nucleotides 
of a DNA molecule having a partial sequence selected from SEQ ID NOS: 1- 94. Preferably, 
oligonucleotide probes for use in the inventive diagnostic methods comprise at least about 1 5 
contiguous oligonucleotides of a DNA molecule having a partial sequence provided in SEQ 
ID NOS: 1- 94. Techniques for both PGR based assays and hybridization assays are well 
known in the art (see, for example, Mullis et al. Ibid; Ehrlich, Ibid), Primers or probes may 
thus be used to detect breast tumor-specific sequences in biological samples, including blood, 
urine and/or breast tumor tissue. 

The following Examples are offered by way of illustration and not by way of 

limitation. 

EXAMPLES 
Example 1 

ISOLATION AND CHARACTERIZATION OF BREAST 
TUMOR POLYPEPTIDES 

This Example describes the isolation of breast tumor polypeptides from a 
breast tumor cDNA library. 

A human breast tumor cDNA expression library was constructed from a pool 
of breast tumor poly A"" RNA from three patients using a Superscript Plasmid System for 
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cDNA Synthesis and Plasmid Cloning kit (BRL Life Technologies, Gaithersburg, MD 20897) 
following the manufacturer's protocol. Specifically, breast tumor tissues were homogenized 
with polytron (Kinematica, Switzerland) and total RNA was extracted using Trizol reagent 
(BRL Life Technologies) as directed by the manufacturer. The poly RNA was then 
purified using a Qiagen oligotex spin column mRNA purification kit (Qiagen, Santa Clarita, 
CA 91355) according to the manufacturer's protocol. First-strand cDNA was synthesized 
using the NotI/01igo-dT18 primer. Double-stranded cDNA was synthesized, ligated with 
EcoRl/BstX I adaptors (Invitrogen, Carlsbad, CA) and digested with Notl. Following size 
firactionation with Chroma Spin-1000 columns (Clontech, Palo Alto, CA 94303), the cDNA 
was ligated into the EcoRI/NotI site of pCDNA3.1 (Invitrogen, Carlsbad, CA) and 
transformed into ElectroMax E. coli DHIOB cells (BRL Life Technologies) by 
electroporation. 

Using the same procedure, a normal human breast cDNA expression library 
was prepared from a pool of four normal breast tissue specimens. The cDNA libraries were 
characterized by determining the number of independent colonies, the percentage of clones 
that carried insert, the average insert size and by sequence analysis. The breast tumor library 
contained 1.14 x 10^ independent colonies, with more than 90% of clones having a visible 
insert and the average insert size being 936 base pairs. The normal breast cDNA library 
contained 6x10^ independent colonies, with 83% of clones having inserts and the average 
insert size being 1015 base pairs. Sequencing analysis showed both libraries to contain good 
complex cDNA clones that were synthesized firom mRNA, with minimal rRNA and 
mitochondrial DNA contamination sequencing. 

cDNA library subtraction was performed using the above breast tumor and 
normal breast cDNA libraries, as described by Hara et al {Blood, S4:189-199, 1994) with 
some modifications. Specifically, a breast tumor-specific subtracted cDNA library was 
generated as follows. Normal breast cDNA library (70 |ig) was digested with EcoRI, Notl, 
and Sful, followed by a fiUing-in reaction with DNA polymerase Klenow fragment. After 
phenol-chloroform extraction and ethanol precipitation, the DNA was dissolved in 1 00 ^l of 
HjO, heat-denatured and mixed with 100 ^il (100 |ig) of Photoprobe biotin (Vector 
Laboratories, Burlingame, CA), the resulting mixture was irradiated with a 270 W sunlamp 
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on ice for 20 minutes. Additional Photoprobe biotin (50 ^1) was added and the biotinylation 
reaction was repeated. After extraction with butanol five times, the DNA was ethanol- 
precipitated and dissolved in 23 |il H2O to form the driver DNA. 

To form the tracer DNA, 10 fig breast tumor cDNA library was digested with 
BamHI and Xhol, phenol chloroform extracted and passed through Chroma spin-400 columns 
(Clontech). Following ethanol precipitation, the tracer DNA was dissolved in 5 \i\ H2O. 
Tracer DNA was mixed with 15 |il driver DNA and 20 |al of 2 x hybridization buffer (1 .5 M 
NaCl/10 mM EDTA/50 mM HEPES pH 7.5/0.2% sodium dodecyl sulfate), overlaid with 
mineral oil, and heat-denatured completely. The sample was immediately transferred into a 
68 °C water bath and incubated for 20 hours (long hybridization [LH]). The reaction mixture 
was then subjected to a streptavidin treatment followed by phenol/chloroform extraction. 
This process was repeated three more times. Subtracted DNA was precipitated, dissolved in 
12 [i\ H2O, mixed with 8 |il driver DNA and 20 jal of 2 x hybridization buffer, and subjected 
to a hybridization at 68 °C for 2 hours (short hybridization [SH]). After removal of 
biotinylated double-stranded DNA, subtracted cDNA was ligated into BamHI/XhoI site of 
chloramphenicol resistant pBCSK^ (Stratagene, La Jolla, CA 92037) and transformed into 
ElectroMax E, coli DHIOB cells by electroporation to generate a breast tumor specific 
subtracted cDNA library. 

To analyze the subtracted cDNA library, plasmid DNA was prepared from 100 
independent clones, randomly picked from the subtracted breast tumor specific library and 
characterized by DNA sequencing with a Perkin Elmer/Applied Biosystems Division 
Automated Sequencer Model 373A (Foster City, CA). Thirty-eight distinct cDNA clones 
were found in the subtracted breast tumor-specific cDNA library. The determined 3' cDNA 
sequences for 14 of these clones are provided in SEQ ID NO: 1-14, with the corresponding 
5' cDNA sequences being provided in SEQ ID NO: 15-28, respectively. The determined one 
strand (5' or 3') cDNA sequences for the remaining clones are provided in SEQ ID NO: 29- 
52. Comparison of these cDNA sequences v^th known sequences in the gene bank using the 
EMBL and GenBank databases (Release 97) revealed no significant homologies to the 
sequences provided in SEQ ID NO: 3, 10, 17, 24 and 45-52. The sequences provided in SEQ 
ID NO: 1, 2, 4-9, 1 1-16, 1 8-23, 25-41 , 43 and 44 were found to show at least some degree of 
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homology to known human genes. The sequence of SEQ ID NO: 42 was found to show 
some homology to a known yeast gene. 

Data was analyzed using Synteni provided GEMTOOLS Software. Twenty 
one distinct cDNA clones were found to be over-expressed in breast tumor and expressed at 
low levels in all normal tissues tested. The determined partial cDNA sequences for these 
clones are provided in SEQ ID NO: 53- 73. Comparison of the sequences of SEQ ID NO: 
53, 54, and 68-71 with those in the gene bank as described above, revealed some homology to 
previously identified human genes. No significant homologies were found to the sequences 
of SEQ ID NO: 55-67, 72 (referred to as JJ 9434,7117), and 73 (referred to as B535S). 

In a further experiment, cDNA fragments analyzed by DNA microarray were 
obtained from two subtraction libraries derived by conventional subtraction, as described 
above. In one instance the tester was derived from primary breast tumors. In the second 
instance, a metastatic breast tumor was employed as the tester. Drivers consisted of normal 
breast. 

cDNA fragments from these two libraries were submitted as templates for 
DNA microarray analysis. DNA chips were analyzed by hybridizing with fluorescent probes 
derived from mRNA from both tumor and normal tissues. Analysis of the data was 
accomplished by creating three groups from the sets of probes. The composition of these 
probe groups, referred to as Breast Tumor/mets, Normal non-breast tissues, and Metastatic 
breast tumors. Two comparisons were performed using the modified Gemtools analysis. The 
first comparison was to identify templates with elevated expression in breast tumors. The 
second was to identify templates not recovered in the first comparison that yielded elevated 
expression in metastatic breast tumors. An arbitrary level of increased expression (mean of 
tumor expression versus the mean of normal tissue expression) was set at approximately 2.2. 

In the first round of comparison to identify overexpression in breast tumors, 
two novel gene sequences were identified, hereinafter referred to as B534S and B538S (SEQ 
ID NO: 89 and 90), and six sequences that showed some degree of homology to previously 
identified genes (SEQ ID NO: 74-79). Additionally, in a second comparison to identify 
elevated expression in metastatic breast tumors, five novel sequences were identified, 
hereinafter referred to as B535S (overexpressed in this analysis as well as what was described 
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above), B542S, B543S, P501S, and B541S (SEQ ID NO: 73, and 91-94), as well as nine gene 
sequences that showed some homology to known genes (SEQ ID NO: 80-88). Clone B534S 
and B538S (SEQ ID NO: 89 and 90) were shown to be overexpressed in both breast tumors 
and metastatic breast tumors. 

Example 2 

GENERATION OF HUMAN CD8+ CYTOTOXIC T-CELLS THAT RECOGNIZE 
ANTIGEN PRESENTING CELLS EXPRESSING BREAST TUMOR ANTIGENS 

This Example illustrates the generation of T cells that recognize target cells 
expressing the antigen B511S, also known as 1016-F8 (SEQ ID NO: 56). Human CD8+ T 
cells were primed in-vitro to the B511S gene product using dendritic cells infected with a 
recombinant vaccinia virus engineered to express B511S as follows (also see Yee et al.. 
Journal of Immunology (1996) 157 (9):4079-86). Dendritic cells (DC) were generated from 
peripheral blood derived monocytes by differentiation for 5 days in the presence of 50 ^g/ml 
GMCSF and 30 ^g/ml IL-4. DC were harvested, plated in wells of a 24-well plate at a 
density of 2 x 10^ cells/well and infected for 12 hours with B51 IS expressing vaccinia at a 
muhiplicity of infection of 5. DC were then matured overnight by the addition of 3 |ig/ml 
CD40-Ligand and UV irradiated at lOOfiW for 10 minutes. CD8+ T cells were isolated using 
magnetic beads, and priming cultures were initiated in individual wells (typically in 24 wells 
of a 24-well plate) using 7x10^ CD8+ T cells and 1 x 10^ irradiated CD8-depleted PBMC; 
IL-7 at 10 ng/ml was added to cultures at day 1 . Cultures were re-stimulated every 7-10 days 
using autologous primary fibroblasts retrovirally transduced with B511S and the 
costimulatory molecule B7.1. Cultures were supplemented at day 1 with 15 I.U. of IL-2. 
Following 4 such stimulation cycles, CD8+ cultures were tested for their ability to 
specifically recognize autologous fibroblasts transduced with B511S using an interferon-y 
Elispot assay (see Lalvani et al J. Experimental Medicine (1997) 186:859-965). Briefly, T 
cells from individual microcultures were added to 96-well Elispot plates that contained 
autologous fibroblasts transduced to express either B511S or as a negative control antigen 
EGFP, and incubated overnight at 3T C; wells also contained IL-12 at 10 ng/ml. Cultures 
were identified that specifically produced interferon-y only in response to B51 IS transduced 
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fibroblasts; such lines were further expanded and also cloned by limiting dilution on 
autologous B-LCL retrovirally transduced with B51 IS. Lines and clones were identified that 
could specifically recognize autologous B-LCL transduced with B51 IS but not autologous B- 
LCL transduced with the control antigens EGFP or HLA-A3. An example demonstrating the 
ability of human CTL cell lines derived from such experiments to specifically recognize and 
lyse B5 1 1 S expressing targets is presented in Figure 1 . 

Example 3 
SYNTHESIS OF POLYPEPTIDES 

Polypeptides may be synthesized on an Perkin Elmer/Applied Biosystems 
Division 430A peptide synthesizer using FMOC chemistry with HPTU (0-Benzotriazole- 
N,N,N',N*-tetramethylxironium hexafluorophosphate) activation. A Gly-Cys-Gly sequence 
may be attached to the amino terminus of the peptide to provide a method of conjugation, 
binding to an immobilized surface, or labeling of the peptide. Cleavage of the peptides from 
the solid support may be carried out using the following cleavage mixture: trifluoroacetic 
acid:ethanedithiol:thioanisole:water:phenol (40:1:2:2:3). After cleaving for 2 hours, the 
peptides may be precipitated in cold methyl-t-butyl-ether. The peptide pellets may then be 
dissolved in water containing 0.1% trifluoroacetic acid (TFA) and lyophilized prior to 
purification by CI 8 reverse phase HPLC. A gradient of 0%-60% acetonitrile (containing 
0.1% TFA) in water (containing 0.1% TFA) may be used to elute the peptides. Following 
lyophilization of the pure fractions, the peptides may be characterized using electrospray or 
other types of mass spectrometry and by amino acid analysis. 

From the foregoing, it will be appreciated that, although specific embodiments 
of the invention have been described herein for the purposes of illustration, various 
modifications may be made without deviating from the spirit and scope of the invention. 
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CLAIMS 

1. An isolated polypeptide comprising an immunogenic portion of a 
breast protein or a variant of said protein that differs only in conservative substitutions and/or 
modifications, wherein said protein comprises an amino acid sequence encoded by a 
polynucleotide molecule comprising a sequence selected from the group consisting of: (a) 
nucleotide sequences recited in SEQ ID NOS: 3, 10, 17, 24, 45-52, 55-67, 72, 73, and 89-94; 
(b) complements of said nucleotide sequences; and (c) sequences that hybridize to a sequence 
of (a) or (b) under moderately stringent conditions. 

2. An isolated polynucleotide molecule comprising a nucleotide sequence 
encoding the polypeptide of claim 1 . 

3. An isolated polynucleotide molecule comprising a sequence provided 
in SEQ ID NOS: 3, 10, 17, 24, 45-52, 55-67, 72, 73, and 89-94. 

4. An expression vector comprising a polynucleotide molecule according 
to any one of claims 2 and 3. 

5. A host cell transformed with the expression vector of claim 4. 

6. The host cell of claim 5 wherein the host cell is selected from the group 
consisting of £. coli, yeast and mammalian cell lines. 

7. A pharmaceutical composition comprising the polypeptide of claim 1 
and a physiologically acceptable carrier. 

8. A vaccine comprising the polypeptide of claim 1 and a non-specific 
immune response enhancer. 
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9. The vaccine of claim 8 wherein the non-specific immune response 
enhancer is an adjuvant. 

10. A vaccine comprising a polynucleotide molecule of any one of claims 
2 and 3 and a non-specific immune response enhancer. 

11. The vaccine of claim 10 wherein the non-specific immune response 
enhancer is an adjuvant. 

12. A pharmaceutical composition for the treatment of breast cancer 
comprising a polypeptide and a physiologically acceptable carrier, the polypeptide 
comprising an immunogenic portion of a breast protein, wherein said protein comprises an 
amino acid sequence encoded by a polynucleotide molecule comprising a sequence selected 
from the group consisting of: (a) nucleotide sequences recited in SEQ ID NOS: 1, 2, 4-9, 1 1- 
16, 18-23, 25-44, 53, 54, 68-71, and 74-88; (b) complements of said nucleotide sequences; 
and (c) sequences that hybridize to a sequence of (a) or (b) under moderately stringent 
conditions. 

13. A vaccine for the treatment of breast cancer comprising a polypeptide 
and a non-specific immime response enhancer, said polypeptide comprising an immunogenic 
portion of a breast protein, wherein said protein comprises an amino acid sequence encoded 
by a polynucleotide molecule comprising a sequence selected from the group consisting of: 
(a) nucleotide sequences recited in SEQ ID NOS: 1, 2, 4-9, 1 1-16, 18-23, 25-44, 53, 54, 68- 
71, and 74-88; (b) complements of said nucleotide sequences; and (c) sequences that 
hybridize to a sequence of (a) or (b) under moderately stringent conditions. 

14. The vaccine of claim 13 wherein the non-specific immxme response 
enhancer is an adjuvant. 

15. A vaccine for the treatment of breast cancer comprising a 
polynucleotide molecule and a non-specific immune response enhancer, the polynucleotide 
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molecule comprising a sequence selected from the group consisting of: (a) nucleotide 
sequences recited in SEQ ID NOS: 1, 2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, and 74-88; 
(b) complements of said nucleotide sequences; and (c) sequences that hybridize to a sequence 
of (a) or (b) under moderately stringent conditions. 

16. The vaccine of claim 15, wherein the non-specific immune response 
enhancer is an adjuvant. 

17. A pharmaceutical composition according to claims 7 or 12, for use in 
the manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

18. A vaccine according to any one of claims 8, 10, 13 or 15, for use in the 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

19. A fusion protein comprising at least one polypeptide according to 

claim 1 . 

20. A pharmaceutical composition comprising a fusion protein according 
to claim 19 and a physiologically acceptable carrier. 

21. A vaccine comprising a fusion protein according to claim 19 and a 
non-specific immune response enhancer. 

22. The vaccine of claim 21 wherein the non-specific immune response 
enhancer is an adjuvant. 

23. A pharmaceutical composition according to claim 20, for use in 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

24. A vaccine according to claim 2 1 , for use in the manufacture of a 
medicament for inhibiting the development of breast cancer in a patient. 
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25. A method for detecting breast cancer in a patient, comprising: 

(a) contacting a biological sample from a patient with a binding agent 
which is capable of binding to a polypeptide, the polypeptide comprising an immunogenic 
portion of a breast protein, wherein said protein comprises an amino acid sequence encoded 
by a polynucleotide molecule comprising a sequence selected from the group consisting of 
nucleotide sequences recited in SEQ ID NOS: 1-94, complements of said nucleotide 
sequences and sequences that hybridize to a sequence provided in SEQ ID NO: 1-94 under 
moderately stringent conditions; and 

(b) detecting in the sample a protein or polypeptide that binds to the 
binding agent, thereby detecting breast cancer in the patient. 

26. The method of claim 25 wherein the binding agent is a 
monoclonal antibody. 

27. The method of claim 26 wherein the binding agent is a 
polyclonal antibody. 

28. A method for monitoring the progression of breast cancer in a patient, 

comprising: 

(a) contacting a biological sample from a patient with a binding agent that 
is capable of binding to a polypeptide, said polypeptide comprising ari immunogenic portion 
of a breast protein, wherein said protein comprises an amino acid sequence encoded by a 
polynucleotide molecule comprising a sequence selected from the group consisting of 
nucleotide sequences recited in SEQ ID NOS: 1-94, complements of said nucleotide 
sequences and sequences that hybridize to a sequence provided in SEQ ID NO: 1-94 under 
moderately stringent conditions; 

(b) determining in the sample an amount of a protein or polypeptide that 
binds to the binding agent; 

(c) repeating steps (a) and (b); and 

(d) comparing the amount of polypeptide detected in steps (b) and (c) to 
monitor the progression of breast cancer in the patient. 
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29. A monoclonal antibody that binds to a polypeptide comprising an 
immunogenic portion of a breast protein or a variant of said protein that differs only in 
conservative substitutions and/or modifications, wherein said protein comprises an amino 
acid sequence encoded by a polynucleotide molecule comprising a sequence selected from 
the group consisting of: (a) nucleotide sequences recited in SEQ ID NOS: 3, 10, 17, 24, 45- 
52, 55-67, 72, 73, and 89-94: (b) complements of said nucleotide sequences; and (c) 
sequences that hybridize to a sequence of (a) or (b) under moderately stringent conditions. 

30. A monoclonal antibody according to claim 29, for use in the 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

31. The monoclonal antibody of claim 30 wherein the monoclonal 
antibody is conjugated to a therapeutic agent. 

32. A method for detecting breast cancer in a patient comprising: 

(a) contacting a biological sample from a patient with at least two 
oligonucleotide primers in a polymerase chain reaction, wherein at least one of the 
oligonucleotides is specific for a polynucleotide molecule encoding a polypeptide comprising 
an immunogenic portion of a breast protein, said protein comprising an amino acid sequence 
encoded by a polynucleotide molecule comprising a sequence selected from the group 
consisting of nucleotide sequences recited in SEQ ID NO: 1-94, complements of said 
nucleotide sequences and sequences that hybridize to a sequence of SEQ ID NO: 1-94 under 
moderately stringent conditions; and 

(b) detecting in the sample a polynucleotide sequence that amplifies in the 
presence of the oligonucleotide primers, thereby detecting breast cancer. 

33. The method of claim 32, wherein at least one of the oligonucleotide 
primers comprises at least about 10 contiguous nucleotides of a polynucleotide molecule 
comprising a sequence selected from SEQ ID NOS: 1-94. 
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34. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies of claim 29; and 

(b) a detection reagent. 

35. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies that bind to a polypeptide encoded 
by a polynucleotide molecule comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOS: 1, 2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, and 74-88, 
complements of said sequences and sequences that hybridize to a sequence of SEQ ID NO: 1, 
2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, or 74-88 under moderately stringent conditions; 
and 

(b) a detection reagent. 

36. The kit of claims 34 or 35 wherein the monoclonal antibodies are 
immobilized on a solid support. 

37. The kit of claim 36 wherein the solid support comprises nitrocellulose, 
latex or a plastic material. 

38. The kit of claims 34 or 35 wherein the detection reagent comprises a 
reporter group conjugated to a binding agent. 

39. The kit of claim 38 wherein the binding agent is selected from the 
group consisting of anti-immunoglobulins. Protein G, Protein A and lectins. 

40. The kit of claim 38 wherein the reporter group is selected from the 
group consisting of radioisotopes, fluorescent groups, luminescent groups, enzymes, biotin 
and dye particles. 

41. A diagnostic kit comprising at least two oligonucleotide primers, at 
least one of the oligonucleotide primers being specific for a polynucleotide molecule 
encoding a polypeptide comprising an immunogenic portion of a breast protein, said protein 
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comprising an amino acid sequence encoded by a polynucleotide molecule comprising a 
sequence selected from the group consisting of nucleotide sequences recited in SEQ ID NOS: 
1-94, complements of said nucleotide sequences and sequences that hybridize to a sequence 
of SEQ ID NO: 1-94 under moderately stringent conditions. 

42. A diagnostic kit of claim 41 wherein at least one of the oligonucleotide 
primers comprises at least about 10 contiguous nucleotides of a polynucleotide molecule 
comprising a sequence selected from SEQ ID NOS: 1-94. 

43. A method for detecting breast cancer in a patient, comprising: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with an oligonucleotide probe specific 
for a polynucleotide molecule encoding a polypeptide comprising an immunogenic portion of 
a breast protein, said protein comprising an amino acid sequence encoded by a polynucleotide 
molecule comprising a sequence selected from the group consisting of nucleotide sequences 
recited in SEQ ID NOS: 1-94, complements of said nucleotide sequences and sequences that 
hybridize to a sequence of SEQ ID NO: 1-94 under moderately stringent conditions; and 

(c) detecting in the sample a polynucleotide sequence that hybridizes to 
the oligonucleotide probe, thereby detecting breast cancer in the patient. 

44. The method of claim 43 wherein the oligonucleotide probe comprises 
at least about 15 contiguous nucleotides of a polynucleotide molecule comprising a sequence 
selected from the group consisting of SEQ ID NOS: 1-94. 

45. A diagnostic kit comprising an oligonucleotide probe specific for a 
polynucleotide molecule encoding a polypeptide comprismg an inununogenic portion of a 
breast protein, said protein comprising an amino acid sequence encoded by a polynucleotide 
molecule comprising a sequence selected from the group consisting of nucleotide sequences 
recited in SEQ ID NOS: 1-94, complements of said nucleotide sequences, and sequences that 
hybridize to a sequence of SEQ ID NO: 1-94 under moderately stringent conditions. 
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46. The diagnostic kit of claim 45, wherein the oligonucleotide probe 
comprises at least about 15 contiguous nucleotides of a polynucleotide molecule comprising a 
sequence selected from the group consisting of SEQ ID NOS: 1-94. 

47. Peripheral blood cells from a patient incubated in the presence of at 
least one polypeptide of claim 1, such that T cells proliferate, for use in the manufacture of a 
medicament for treating breast cancer in a patient. 

48. The blood cells of claim 47 wherein the T cells is repeated one or 

more times. 

49. A composition for the treatment of breast cancer in a patient, 
comprising T cells proliferated in the presence of a polypeptide of claim 1, in combination 
with a pharmaceutically acceptable carrier. 

50. An antigen presenting cells incubated in the presence of at least one 
polypeptide of claim 1, for use in the manufacture of a medicament for treating breast cancer 
in a patient. 

51. The cells of claim 50 wherein the antigen presenting cells are selected 
from the group consisting of dendritic and macrophage cells. 

52. A composition for the treatment of breast cancer in a patient, 
comprising antigen presenting cells incubated in the presence of a polypeptide of claim 1, in 
combination with a pharmaceutically acceptable carrier. 
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SEQUENCE LISTING 

<110> Corixa Corporation 

<120> COMPOUNDS FOR IMMUNOTHERAPY AND DIAGNOSIS OF BREAST CANCER AND 
METHODS FOR THEIR USE 

<130> 210121. 446PC 

<140> PCT 

<141> 1998-12-22 

<160> 94 

<170> Patentin Ver. 2.0 

<210> 1 

<211> 402 

<212> DNA 

<213> Homo sapiens 

<400> 1 

tttttttttt tttttaggag aactgaatca aacagatttt attcaacttt ttagatgagg 60 

aaaacaaatn atacgaaatn ngtcataaga aatgctttct tataccacta tctcaaacca 120 

ctttcaatat tttacaaaat gctcacgcag caaatatgaa aagctncaac acttcccttt 180 

gttaacttgc tgcaatnaat gcaactttaa canacataca aatttcttct gtatcttaaa 240 

agttnaatta ctaattttaa tgatnttnct caagatnttt attcatatac ttttaatgac 300 

tcnttgccna tacatacnta ttttctttac ttttttttta cnatnggcca acagctttca 36 0 

ngcagnccnc aaaaatctta ccggttaatt acacggggtt gt 4 02 

<210> 2 

<2il> 424 

<212> DNA 

<213> Homo sapiens 

<400> 2 

tttttttttt ttttttaaag gtacacattt ctttttcatt ctgtttnatg cagcaaataa 60 
ttcgttggca tcttctctgt gatgggcagc ttgctaaaat tanactcagg ccccttagct 120 
ncatttccaa ctnagcccac gctttcaacc nngccnaaca aagaaaatca gttngggtta 180 
aattctttgc tgganacaaa gaactacatt cctttgtaaa tnatgctttg tttgctctgt 240 
gcaaacncag attgaaggga anaagganac ttntggggac ggaaacaact ngnagaagca 300 
gganccgccc agggncattt cctcaccatg cttaatcttg cnctcacttg cngggcacca 360 
ttaaacttgg tgcaaaaggc gcaattggtg nanggaaccc cacaccttcc ttaaaaagca 420 
gggc 424 

<210> 3 

<211> 421 

<212> DNA 

<213> Homo sapiens 

<400> 3 

tttttttttt tttttcccaa tttaaaaaag cctttttcat acttcaatta caccanactt 60 
aatnatttca tgagtaaatc ngacattatt atttnaaaat ttgcatattt aaaatttgna 120 
tcanttactt ccagactgtt tgcanaatga agggaggatc actcaagngc tgatctcnca 180 
ctntctgcag tctnctgtcc tgtgcccggn ctaatggatc gacactanat ggacagntcn 240 
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cagatcttcc gttcttntcc cttccccaat 
tttggggaca tgntaatttt gcnatcctta 
gggtggttaa tgttcgncng gcttnttgac 
c 



ttcncaccnc tccccttctt ncccggatcn 300 
aaccctgccc gccangggtc ccnanctcag 360 
cncctgcgcc ctttnantcc naaccccaag 420 

421 



<210> 4 

<211> 423 

<212> DNA 

<213> Homo sapiens 



<400> 4 

tttttttatt tttttttcta tttntnntat 
tgtgtatgcg tangtacnta tgtntgcata 
aaaatctcaa natngtantt ggttnatggg 
atggacacng tgaaatgtag ccgctnatca 
cctccnaata aaaatnccng gccctactgg 
acatgcanac nagttaaacc tgtgnactgg 
ttcnccccan ggacantcng aattttttta 
tgt 



ttnntgnggt tcctgtgtgt aattagnang 60 
tttaacctgt tncctttcca tttttaaaat 120 
agtaaanaga gactatngat naattttaac 180 
ntttaaaact tcattttgaa ggccttttnc 240 
gttaagcaac attgcatntc taaagaaacc 300 
tcangcaaac cnanntggaa nanaagggnn 36 0 
acaaattacn atnccccccc ngggggagcc 420 

423 



<210> 5 

<211> 355 

<212> DNA 

<213> Homo sapiens 



<400> 5 

acgaccacct natttcgtat ctttcaactc 
tccaggaaga caggtctcaa cttagggatc 
gcaacctggc acttcaagga agtgcaccga 
aggtggccaa ctgatcactg taggagctga 
agtgaccaan acnaccattn aggatcaccc 
ccaaacggct ggccaatggg ggggtttaat 



ttttcgaccg gacctcttat tcggaagcgt 60 
agatcacgtt atcaacgctc tgggatcgct 120 
tnacgtctag accggccaac acagatctag 180 
ctggcaanan tcaaccgggc cccaaccnag 24 0 
acaggcactc ctcgtcctag ggccaaccna 300 
atttggttna aaaattgatt ttaaa 355 



<210> 6 

<211> 423 

<212> DNA 

<213> Homo sapiens 



<400> 6 

tttttttttt tttttggaca ggaagtaaaa 
acattggaag ccctcatgan tgcagggccc 
tgtacttaac cccacagccn tctgggatna 
catcagcatt aaacttggta aanccccact 
naaacttgaa cttggccctg cgcagggcct 
tgcgnaagga cntaatnact tggccnatgt 
caaaggcacc tcgcaagcct ntttggancc 
ttt 



tttattggtn antattaana ggggggcagc 60 
gccacttgtc cagagggcca cnattgggga 120 
gccgcttttc agccaccatn tcttcaaatt 180 
tctttaagat ntgnatcttc tggcggccag 240 
caatcacatg ctccttgttc tgcagcttgg 300 
gaaccctggc cacantgccc tggggctttc 360 
tgnccgcccc ngcacaggga caacatcttg 420 

423 



<210> 7 

<211> 410 

<212> DNA 

<213> Homo sapiens 



<400> 7 

ttcgcactgg ctaaaacaaa ccgccttgca aagttngaaa aatttatcaa tggaccaaat 60 
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aatgctcata tccnacaagt tggtgaccgt 
nanttgttgt acaataatgt tccaatttng 
gtaaatatca ggcagctttt gatggggcta 
gaggtctgct tcgccngtgt anatgggaaa 
catattgttg tacatgcaga tgaatnngaa 
tttttnnaaa agctnatcac catgttggaa 

<210> 8 

<211> 274 

<212> DNA 

<213> Homo sapiens 



tnttatnata aaaaaatgta tnatgctcct 120 
gacnttcggc atctaccctg gttcacctgg 180 
ggaaagctaa cagtactcga acatgggaaa 24 0 
naattccgtc ttgctcngat ttgtggactt 300 
gaacttgtca actactatca ggatcgtggc 360 
gcggcactng gacttgagcg 410 



<400> 8 

tttttttttt tttttaggtc atacatattt 
taatatatgt gtatatatcc acgtgtgtgt 
atctatatct ntngctcaca tatgcatggg 
taatatgttt taaaactcan anaaatcnga 
ttgtngataa tcttgaanaa tnacnaaaac 



tttattataa canatatntg tatatacata 60 
gtgtgtatca aaaacaacan aantttagtg 120 
agataccagt aaaaaataag tnaatctcca 180 
gagaccnaaa gaaaacgttn atcannatga 240 
atat 274 



<210> 9 

<211> 322 

<212> DNA 

<213> Homo sapiens 



<400> 9 

tttttttttt ttttgtgcct tattgcaccg 
ataagagtga taagtgtgaa aatccttgcc 
gtttttcacc attaantgta ataatggctn 
ttaaaaaagt tttcctctnt ccctngttat 
atttngttca gtacttttnc ctgcacctat 
cctnnanata tgatggatna ca 



gcnanaactt ctagcactat attaaactca 60 
ttctctttaa tcttaatgna naggcatctg 120 
tatgtatttt tatnnatggt cttnatggag 180 
ctaanagttt tnatcaaaaa tgggtataat 240 
agatatgatn ctgttatttt ttcttcttng 300 

322 



<210> 10 

<211> 425 

<212> DNA 

<213> Homo sapiens 



<400> 10 

tttttttttt tttttattct gcagccatta 
ggtcacaaaa taagtacaga atatnacaca 
ttctatattt acaatatctc tggaattcca 
cttctcaggc agcagcgcct tctggcaacc 
ggccaaggat caggaaacag aanaatggaa 
actatctnaa ctgctagttt tcaggcttta 
caaccggaag catcctagat ggtacactct 
caggn 



aatgctgaac actagatnct tatttgtgga 60 
cgccctgccc ataaaaagca cagctcccag 120 
ccttcccttc taatttgact aatatttctg 180 
ataagaacca acntgnggac taggtcggtg 240 
gnagcccccn tgacnctatt aanctntnaa 300 
aatcatgtaa natacgtgtc cttnttgctg 360 
ctccaggtgc caggaaaaga tcccaaatng 420 

425 



<210> 11 

<211> 424 

<212> DNA 

<213> Homo sapiens 



<400> 11 

ttttnttant ttttttancc nctnntccnn tntgttgnag ggggtaccaa atttctttat 60 
ttaaaggaat ggtacaaatc aaaaaactta atttaatttt tnggtacaac ttatagaaaa 120 
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ggttaaggaa accccaacat gcatgcactg 
ntggggaaat tagcccaang ctnagctttc 
aaaaaaattt nccgccnagc cnaatccggg 
gtcccccnat tctttaangg cttncacctn 
ccacngatgg gggtcatttt tntcnnttag 
ccan 



ccttggtaac cagggnattc ccccncggct 18 0 
attatcactn tcccccaggg tntgcttttc 240 
cnctcccatc tggcgcaant tggtcacttg 3 00 
ctcattcggg tnatgtgtct caattaaatc 360 
ccagtttgtg nagttccgtt attganaaaa 420 

424 



<210> 12 

<211> 426 

<212> DNA 

<213> Homo sapiens 



<400> 12 

tttttttttt ttttncttaa aagcttttat 
atgttgtctg ctttttccac tagagccctt 
ctaataattc cnaaactggt atcataaata 
tcacactgtg ttngttgctt tttnacatgc 
nacatacctg gttntacaac ctgaggtaan 
ctggctaaga gctnggcnct gttnantant 
tttccnctng tgtccttgct tnagtacccc 
ctnaat 



ctcctgctta cattacccat ctgttcttgc 60 
aacaacttaa tcatggttat tttaagggct 120 
agtctcgttc tnatgcttgt tttctctcta 180 
tttgtaattt ttggctgaaa gctgaaaaat 24 0 
cagccttnta gtgtgaggtt ttatatntta 300 
tgttgtanct ntatatgcca naggctttna 360 
attnttttag gggttcccta naaactctat 420 

426 



<210> 13 

<211> 419 

<212> DNA 

<213> Homo sapiens 



<400> 13 

tttttttttt tttttnagat agactctcac 
aatcaaggct cactgcaacc tctgccttat 
ttaaaaatat ctctncacaa ctaatgtata 
ggctcagccc tcgnaacaca tttccctgtt 
anatntaagc ttttccaggc ccagaaaagc 
acttgccacc ctgtggcagc acagctccac 
tgtaancccc ctgnaagacc cggatcagct 

<210> 14 

<211> 400 

<212> DNA 

<213> Homo sapiens 



tctttcgccc aggctggagt gcagtggcgc 60 
aaagcatttn ctaaaggtac aagctaaatt 120 
acaaaaatta gttctacctc ataaacncnt 180 
ctcaactgat gaacactcca naaacagaac 240 
tcgcgagggg atttgctntg tgtgtgacac 3 00 
acntgctttg ggccgcattt gcaagttctc 360 
gggtngaaat tgcangcnct cttttggca 419 



<400> 14 

aanccattgc caagggtatc cggaggattg 
ccctcaggaa agcaaagagc ttgaaaaatg 
tcanactgct ccaacaagga tntgcanagg 
actgcagtcn tcccccantg gcagaaggat 
gaaggtcgtg gatnacttgg accgagcctc 
aagacaaagc anttcatcga cgccaacccc 
gcggcgcccc cgcccagggc cttaataanc 



tggctgtcac aggtnccgag gcccanaagg 60 
tctctctgtc atggaagccn aagtgaaggc 120 
gagatcgcta accttggaga ggccctggcc 180 
gaattgcggg agactctcan atcccttang 24 0 
nnaagccaat ntccagaaca agtgttggag 300 
naccggcctc tnttctcctg ganattgana 360 
cntgaagctn 4 00 



<210> 15 

<211> 395 

<212> DNA 

<213> Homo sapiens 
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<400> 15 

tgctttgctg cgtccaggaa 
agcgagatta gacntactga 
tgatgaagca attgtgaagc 
tactgaagca ggtatgttcg 
catgaaagcn gtcagaanag 
ctntgtattt actgttggan 
aagttcaaga aaattatgtt 

<210> 16 
<211> 404 
<212> DNA 

<213> Homo sapiens 



gattagatng aanaatacat 
anatccatgc aggtcccatt 
tatcggatgg ctttnatgga 
caattcgtgc tgatcatgat 
tggctnattc tnaaagctgg 
ttttgatgct gcatgacaga 
agttttggcc attat 



attgatttgc caaatgaaca 60 
acaaagcatg gtgaaataga 120 
gcagatctga gaaatgtttg 180 
tttgtagtac aggaagactt 240 
agtctaaatt ggacnacnac 300 
ttttgcttan tgtaaaaatn 360 

395 



<400> 16 

ccaccactaa aatcctggct gagccctacn 
nagggcacac cctttaagtn aggtgacagg 
ttcacctctt gggcttgagt acctggttct 
ctnccatgta cgtgctgcca atngtcttga 
gtaggagctg caggcnagaa ngaagcggtt 
tgcccctgct tctcaaggag gggtgtgggg 
ntctcanggc ttccctnccg gctnancgca 



agtacctgtg cccctccccc aggacgagat 60 
tcacctttaa gtgaggacag tcagctnaat 120 
cgtgccctga ggcgacnctn agccctgcag 180 
tcttctccac gccnctnaac ttgggcttca 24 0 
aacagcgcca ctccatagcc gcagccnggc 300 
ttcctccacc atcgccgccc ttgcaaacac 360 
ngacttaagc atgg 404 



<210> 17 
<211> 360 
<212> DNA 

<213> Homo sapiens 



<400> 17 

ggccagaagc tttccacaaa ccagtgaagg 
gctggcagca gctgctatct ngatngacng 
acctcatacc tnaccgcttc cctttnaatg 
cggggagaac catacttatt cccctnttcc 
ctctncaatt actntctccn ctgctttntt 
cccgcaatta cctggcaaaa ctcgcgaccc 



tggcagcaaa gaaagcctct tagacnagga 60 
cagaaaccaa ccactaattc agcaaacaca 120 
gccttcggtg tgtgcgcaca tgggcacgtg 180 
cggcctacca cctctnctcc cccttctctt 240 
ctnancacta ctgctngtnt cnanagccng 300 
ttcgggcagc gctaaanaat gcacatttac 360 



<210> 18 

<211> 316 

<212> DNA 

<213> Homo sapiens 



<400> 18 

atacatatac acatatatga ttttagatag 

tgtgtgtata tgtatgtgtc tactcatttt 

agccagtttt tcatttgctt aaatnactca 

taatgttnag tagtgagatt ctgttgaagg 

cattcatgtt gttttttaaa agcttatttg 
tcgatnttat gtangt 



agccatatac ctngaagtag tanatttgtt 60 
aaataaactt gtgatagaga tgtaattntg 120 
ccaagtaact aattaagttn tctttactct 180 
tgatattaaa aaccattcta tattaattaa 240 
aaatcnaatt atgattattt ttcataccag 300 

316 



<210> 19 

<211> 350 

<212> DNA 

<213> Homo sapiens 



<400> 19 
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aagggatgca nataatgctg tgtatgagct 
tactattgaa catgctnggg ctcggtcacg 
ccgttttagt agtcgcagac ctcgaaatga 
anatcgtctt atagttgaga atttatcctc 
agttttgagt tnttttgatg tggcttttta 
ttttattaaa agtagttttn aattaatgga 



tgatggaaaa gaactctgta gcgaaagggt 60 
aggtggaaga ggtagaggac gatactctga 120 
tagacgaaat gctccacctg taagaacaga 180 
aagagtcagc tggcaggttt gttganatac 24 0 
aaaaagttat gggttactna tgttatattg 300 
tntgatggaa ttgttgtttt 350 



<210> 20 

<211> 367 

<212> DNA 

<213> Homo sapiens 



<400> 20 

gntnnncnca agatcctnct ntcccccngg 
ntaanatcnn gccgcncccg aagtctcnct 
ncaattntga cctnnggcga anaatggcng 
tagnatctga ccactangac ccnctatcct 
ccaattagtg catgntanag cntcctggcc 
cgcccctacc angncatccn catctactag 
ccccnct 



gcngccccnc cnccngtnat naccggtttn 60 
nntgccgaga tgncccttat ncncnnatgn 120 
nngtgtatca gtntccnctc tgnggnctct 18 0 
ctcaaaccct gtanncngcc ctaatttgtg 24 0 
cagatggcnt ccatatcctg gtncggcttc 300 
agcttatccg ctncntgngg cgcaccggnt 360 

367 



<210> 21 

<211> 366 

<212> DNA 

<213> Homo sapiens 



<400> 21 

cccaacacaa tggtctaagt anaactgtat 
ctacaatggg aaaatccata cataagtcag 
tcctttatct tctgaagaaa gtacacacct 
tccaggcatc agttggatga ttcatcatgg 
tgtcatacga aaacaccagt tctgcccnna 
tgggaatatt cactctagac cagaaacagc 
aatttn 



tgctctgtag tatagttcca cattggcaac 60 
ttacttcctn atgagctttc tccttctgaa 120 
tggtnatgat atctttgaat tgcccttctt 180 
taattatggc attatcatat tcttcatact 240 
gatgagcttg ttctgcagct cttagcacct 300 
tcccggtgct ccctcatttt ctgaggctta 360 

366 



<210> 22 

<211> 315 

<212> DNA 

<213> Homo sapiens 



<400> 22 

acttaatgca atctctggag gataatttgg atcaagaaat aaagaanaaa tgaattagga 60 
gaagaaatna ctgggtnata tttcaatatt ttagaacttt aanaatgttg actatgattt 120 
caatatattt gtnaaaactg agatacangt ttgacctata tctgcatttt gataattaaa 180 
cnaatnnatt ctatttnaat gttgtttcag agtcacagca cagactgaaa ctttttttga 240 
atacctnaat atcacacttn tncttnnaat gatgttgaag acaatgatga catgccttna 300 
gcatataatg tcgac 315 

<210> 23 

<211> 202 

<212> DNA 

<213> Homo sapiens 

<400> 23 

actaatccag tgtggtgnaa ttccattgtg ttgggcaact caggatatta aatttatnat 60 
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ttaaaaattc ccaagagaaa naaactccag 
aaatgttnca nnaaganatg acgctgattc 
acacccaccg nttcatttta tg 

<210> 24 

<211> 365 

<212> DNA 

<213> Homo sapiens 



gccctgattg tttcactggg gaattttacc 120 
tgtnaaatct ttttcagaag atagaggaga 180 

202 



<400> 24 

ggatttcttg cccttttctc cctttttaag 
ctttctgcca tacaaccgct accacatctg 
ggatctcgga accnagtgtt nacttcattt 
tggtctgtct gtatteacca tggggcctgt 
gtgccctcca ttgtggccac atgggagacc 
acggcactgc atctnacccg acntgggatt 
gatca 



tatcaatgta tgaaatccac ctgtaccacc 60 
gctcctagaa cctgttttgc tttcatagat 120 
ttaaacccca ttttagcaga tngtttgctn 18 0 
acacaccacg tgtggttata gtcaaacaca 240 
catnacccna tactgcatcc tgggctgatn 300 
gaacccgggg tgggcagcng aattgaacag 360 

365 



<210> 25 

<211> 359 

<212> DNA 

<213> Homo sapiens 



<400> 25 

gtttcctgct tcaacagtgc ttggacggaa 
gcccatagcc agccctccgt cacctcttca 
ccgccnctcc ngcgccncgc agccaccgcc 
nacaacgcgt ccacctcgca ngttcgccng 
aaccgcccga tcaacctgga gctctncccc 
nttaaccgcc gnttattttg cttnaaaaga 



cccggcgctc gttccccacc ccggccggcc 60 
ccgcaccctc ggactgcccc aaggcccccg 120 
gccnccncca cctctccttn gtcccgccnt 180 
aactaccacc nggactcata ngccgccctc 240 
ccgacnttaa cctttccntg tcttacttac 300 
acttttcccc aatactttct ttcaccnnt 359 



<210> 26 

<211> 400 

<212> DNA 

<213> Homo sapiens 



<400> 26 

agtgaaacag tatatgtgaa aaggagtttg 
ttataatttc caataggata ctcatcagtt 
ggtttctggt ttcagatttg aactctcaag 
acnacnaana aatctnaacn aacngaanac 
ttcagccacg aggaaaacta tcnccctnaa 
aatcacagtg aatcatagcc caagatcagc 
acttacaggg accacttcac agtnngtnga 



tgannagcta cataaaaata ttagatatct 60 
ttgaataana gacatattct agagaaacca 120 
agcttggaag ttatcactcc catcctcacg 180 
caatgacttt tcttagatct gtcaaagaac 24 0 
tactggggac tggaaagaga gggtacagag 300 
ttgcccggag ctnaagctng tacgatnatt 360 
tnaantgccn 4 00 



<210> 27 

<211> 366 

<212> DNA 

<213> Homo sapiens 



<400> 27 

gaatttctta gaaactgaag tttactctgt 
agggcgctng aatcatagca aatattctca 
gaattttaca ttttccagaa aacactcctt 
gctgtagact gggctgcact ggacacctgc 



tccaagatat atcttcactg tcttaatcaa 60 
tctttcaact aactttaagt agttntcctg 120 
tctgtatctg tgaaagaaag tgtgcctcag 180 
gggggactct ggctnagtgn ggacatggtc 24 0 
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agtattgatt ttcctcanac tcagcctgtg 
gcagttnacg tcatcccaca catcttggac 
tatgna 

<210> 28 

<211> 402 

<212> DNA 

<213> Homo sapiens 



tagctntgaa agcatggaac agattacact 300 
tccnagaccc ggggaggtca catagtccgt 360 

366 



<400> 28 

agtgggagcc tcctccttcc ccactcagtt 
ggaagtggcc agctgcagcg cctcctgcag 
agacacatcc ttgccaccac cttcaccgtc 
gctngctttt aagccccgat nggctgcatt 
gccagcctca ttgtccaccg tgaagagcat 
anagcttcaa ggcttcattc agggccttng 
acnagaggct ggtnngggtn actntcaata 



ctttacatcc ccgaggcgca gctgggcnaa 60 
gcagccaacg ttcttgcctg tggcctgtgc 120 
catcangcct gacacctgct gcacccactc 180 
ctgggggact tgacacaggc ncgtgatctt 24 0 
ggcaaaaagt ctgaggggag tgcatcttga 3 00 
ctnaggcgcc nctctccatc tccnggaata 360 
aactgcttcg tc 402 



<210> 29 

<211> 175 

<212> DNA 

<213> Homo sapiens 



<400> 29 

cggacgggca tgaccggtcc ggtcagctgg 
tctcccttct tgggggccga gggcttcctg 
ttcagccgct gcacgttggt ctgcagggac 



gtggccagtt tcagttcttc agcagaactg 60 
gggaagagga tgagtttgga gcggtactcc 120 
tccgtggact tgttccgcct cctcg 175 



<:210> 30 

<211> 360 

<212> DNA 

<213> Homo sapiens 



<400> 30 

ttgtatttct tatgatctct gatgggttct 
gcatgctcca gatttaaatc cagctgaggc 
tggaaggaaa cttcacggac aggaagactg 
aggtctgggg aatcatgtaa agggtaccca 
gttctttcag ggaaccaaac ccagaattcg 
gataaatgcc ttgggacctg gagtgctggg 



tctcgaaaat gccaagtgga agactttgtg 6 0 
tccctttgtt ttcagttcca tgtaacaatc 120 
ctggagaaga gaagcgcgtt agcccatttg 180 
gacctcactt ttagttattt acatcaatga 240 
gtgcaaaagc caaacatctt ggtgggattt 300 
cttgtgcaca ggaagagcac cagccgctga 360 



<210> 31 

<211> 380 

<212> DNA 

<213> Homo sapiens 



<400> 31 

acgctctaag cctgtccacg agctcaatag 
acgctacgcc atggtttatg gatacaatgc 
agggaccnaa tgagactgag ggaagaaaaa 
gattttgtat ccccctgtnn cagcattncn 
ttcctgtata ttctcttgtc tggctgcacc 
tgaaagtgca ctgcagtnag ggtcaangga 
acttctggtg tgatactttc 



ggaagcctgt gatgactaca gactttgcga 60 
tgcctataan cgctacttca ggaagcgccg 120 
aaatctcttt ttttctggag gctggcacct 180 
gaaatacata ggcttatata caatgcttct 240 
ccttnttccc gcccccagat tgataagtaa 300 
gactcancat atgtgattgt tccntnataa 360 

380 
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<210> 32 

<211> 440 

<212> DNA 

<213> Homo sapiens 



<400> 32 

gtgtatggga gcccctgact cctcacgtgc 
ccaccccctg cacctccacc tgccccagcc 
cctcacttct ggggtggatg atgtgacctt 
ccctgaagtc ttacggtcca acatcaggac 
caggggagac cgtntcanta gggatgtgtg 
gtganaagca cgtggcggct tctgggggcc 
cttggagaac ctcagtcccn gtagccccct 
caccctttgg gggttggggt 



ctgatctgtg cccttggtcc caggtcaggc 60 
cctgcctctg ccccaagtgg ggccagctgc 120 
cctnggggga ctgcggaagg gacaagggtt 180 
caagtcccat ggacatgctg acagggtccc 24 0 
cctggctgtg tacgtgggtg tgcagtgcac 300 
atgtttgggg aaggaagtgt gcccnccacc 360 
gccctggcac agcngcatnc acttcaaggg 420 

440 



<210> 33 

<211> 345 

<212> DNA 

<213> Homo sapiens 



<400> 33 

tattttaaca atgtttatta ttcatttatc 
gattatttgg agtgggtccc aacctagggc 
ctcattttgt gacttaggtg ggggcatggt 
gtgcagaagg aggacctagg cacacacata 
aggctggaag acaaaagtct cccaataaag 
tggtctgctg ggaatgttgt tgttggggtg 



cctctataga accaccaccc acaccgagga 60 

ctggactctg aaatctaact ccccacttcc 120 

tcagtcagaa ctggtgtctc ctatcggatc 180 

tggtggccac acccaggagg gttgattggc 24 0 

gcacttttac ctcaaagang gggtgggagt 300 

gggaagantt atttc 34 5 



<210> 34 

<211> 440 

<212> DNA 

<213> Homo sapiens 

<400> 34 

tgtaattttt ttattggaaa acaaatatac 
ccataagcag attttaagtg gctaaacaaa 
tgtttctggt acaggaccag cagtacaaaa 
ccgttttgca atagtgcaac ttttaagtac 
gttacaactc cacacttcaa caacaacatg 
aaaggcataa cccagatgtt ccctcatttg 
agggcttana ttttcccaga gtaagccnca 
ataaggtttt aggacaatga 



aacttggaat ggattttgag gcaaattgtg 60 

gtttaaaaag caagtaacaa taaaagaaaa 120 

aaatagtgta cgagtacctg gataatacac 180 

atattgttga ctgtccatag tccacgcaga 240 

ctgacagttc ctaaagaaaa ctactttaaa 300 

accaactcca tctnagttta gatgtgcaga 360 

tgcaacatgt tacttgatca attttctaaa 420 

440 



<210> 35 

<211> 540 

<212> DNA 

<213> Homo sapiens 



<400> 35 

atagatggaa tttattaagc ttttcacatg 
gtactaacaa aaactctagc aatcaagaat 
ctgtggcttt taaaatttgg ttttcataag 
cttttaaaaa atgctttagg tcactccaag 
ataaaacaat cacaatttaa taaataacaa 
gtataaggga aaaggtggta gtgttganta 



tgatagcaca tagttttaat tgcatccaaa 60 

ggcagcatgt tattttataa caatcaacac 120 

ataatttata ctgaagtaaa tctagccatg 180 

cttggcagtt aacatttggc ataaacaata 240 

atacaacatt gtaggccata atcatataca 300 

agcagttatt agaatagaat accttggcct 360 
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ctatgcaaat atgtctagac actttgattc actcagccct gacattcagt tttcaaagtt 420 
aggaaacagg ttctacagta tcattttaca gtttccaaca cattgaaaac aagtagaaaa 480 
tgatganttg atttttatta atgcattaca tcctcaagan ttatcaccaa cccctcaggt 540 

<210> 36 
<211> 555 
<212> DNA 

<213> Homo sapiens 
<400> 36 

cttcgtgtgc ttgaaaattg gagcctgccc cccggcccat aagcccttgt tgggaaccga 60 

gaagtgtata tggggcccaa nctactggtg ccagaacaca gagacagcag cccantgcaa 120 

tgctgtcgag cattgcaaac gccatgtgtg gaactaggag gaggaatatt ccatcttggc 180 

agaaaccaca gcattggttt ttttctactt gtgtgtctgg gggaatgaac gcacagatct 24 0 

gtttgacttt gttataaaaa tagggctccc ccacctcccc cntttctgtg tnctttattg 300 

tagcantgct gtctgcaagg gagcccctan cccctggcag acananctgc ttcagtgccc 360 

ctttcctctc tgctaaatgg atgttgatgc actggaggtc ttttancctg cccttgcatg 420 

gcncctgctg gaggaagana aaactctgct ggcatgaccc acagtttctt gactggangc 480 

cntcaaccct cttggttgaa gccttgttct gaccctgaca tntgcttggg cnctgggtng 540 
gnctgggctt ctnaa 555 



<210> 37 

<211> 280 

<212> DNA 

<213> Homo sapiens 



<400> 37 

ccaccgacta taagaactat gccctcgtgt 
acgtggattt tgcttggatc ttggcaagaa 
ctctaaaaaa tatcctgact tctaataaca 
aggtgaactg ccccnagctc tcgtaaccag 
ttncttctgc ttcgctttcc cctaccccac 



attcctgtac ctgcatcatc caactttttc 60 
accctaatct ccctccagaa acagtggact 120 
ttgatntcaa gaaaatgacg gtcacagacc 18 0 
gttctacagg gaggctgcac ccactccatg 24 0 
cccccgccat 280 



<210> 38 

<211> 303 

<212> DNA 

<213> Homo sapiens 



<400> 38 

catcgagctg gttgtcttct tgcctgccct 
tatcaaggga aaggcaagac tgggacgtct 
cttcacacag gtgaactcgg aagacaaagg 
gaccaattac aatgacngat acgatnagat 
tcctaagtct gtggctcgta tcgccnagct 
taa 



gtgtcgtaaa atgggggtcc cttactgcat 60 
agtccacagg aagacctgca ccactgtcgc 120 
cgctttggct nagctggtgn aagctatcag 180 
ccgccntcac tggggtagca atgtcctggg 24 0 
cgaanaggcn aangctaaag aacttgccac 300 

303 



<210> 39 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 39 

gactcagcgg ctggtgctct tcctgtgcac 
atcaaatccc accaagatnt ttggcttttg 
aactcattga tgtaaatnac tnaaagtgag 



aagcccagca ctccaggtcc caaggcattt 60 
caccgaattc tgggtttggc tccctnaaag 120 
gtctgggtac cctttacatg attccccaga 180 
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cctcanatgg gctaacacgc ttctcttctc cagcagtctt cctntccgtg aagttacctt 240 
ccagattgtt acatggaact gaanacaaag ggagcctcag ctngatttaa atctggagca 300 

<210> 40 

<211> 318 

<212> DNA 

<213> Homo sapiens 



<400> 40 

cccaacacaa tggctgagga caaatcagtt 
tgggctgttg ggcgaccaag gccttcccgg 
gatggtgaag ctgacggaga agcacaggtc 
catcattggg ggcatgttca cagtggctgg 
acgagccatc cagaaaaaaa ttgatctngg 
tctgtctcct ctttctcc 



ctctgtgacc agacatgaga aggttgccaa 60 
agtcttcgtc ctctatgagc tctcgcccat 120 
cttcacccac ttcctgacag gtgtgtgcgc 180 
actcatcgat tcgctcatct accactcagc 24 0 
gaagacnacg tagtcaccct cggtncttcc 300 

318 



<210> 41 

<211> 302 

<212> DNA 

<213> Homo sapiens 



<400> 41 

acttagatgg ggtccgttca ggggatacca 
cttggcctga atgttcccca tccggacaca 
ccatnaccat ctcggtaacc tactcttact 
ataatnagtc cataatgtaa atgcctggcc 
ccaaacnatt accagacatt cctcttanat 
tc 



gcgttcacat ttttcctttt aagaaagggt 60 
ggctgcatgt ctctgtnagt gtcaaagctg 120 
ccacaatgtc tatnttcact gcagggctct 18 0 
caagacntat ggcctgagtt tatccnaggc 240 
tgaaaacgga tntctttccc ttggcaaaga 300 

302 



<210> 42 

<211> 299 

<212> DNA 

<213> Homo sapiens 



<400> 42 

cttaataagt ttaaggccaa ggcccgttcc 
ggtttggaca tacctcatgt aaatgtggtt 
gattacatcc atcgagtagg tcgaacagct 
tttgtcacac agtatgatgt ggaactcttc 
ctaccaggtt ttccaacaca ggatgatgag 



attcttctag caactgacgt tgccagccga 60 
gtcaactttg acattcctac ccattccaag 120 
agagctgggc gctccggaaa ggctattact 18 0 
cagcgcatag aacacttnat tgggaagaaa 24 0 
gttatgatgc tnacggaacg cgtcgctna 299 



<210> 43 

<211> 305 

<212> DNA 

<213> Homo sapiens 



<400> 43 

ccaacaatgt caagacagcc gtctgtgaca 
ccttcattgg caatagcaca gccntccggg 
ctgccatgtt ccgccggaag gccttcctcc 
tggagttcac cgaggctgag agcaacatga 
gggatgccac cgcagaaana ggaggaggat 
aggca 



tcccacctcg tggcctcaan atggcagtca 60 
agctcttcaa gcgcatctcg gagcagttca 120 
actggtacac aggcgagggc atggacaaga 180 
acgacctcgt ctctnagtat cagcagtacc 240 
ttcggtnagg aggccgaaga aggaggcctg 300 

305 



<210> 44 
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<211> 399 
<212> DNA 
<213> Homo sapiens 



<400> 44 

tttctgtggg ggaaacctga tctcgacnaa 
tggaacagaa cgaaaacnga tnaatctctg 
cagacacagc tccnaattga ttccttcttt 
ttaacgtatt aagagccnga gactaaacag 
agaagcagcn gcccgcgnaa ttngaagcng 
ttctttatta ggccaacgaa aaaccccgaa 
tccngccnna aaaagaaaga agctttcnga 



attagagaat tttgtcagcg gtatttcggc 60 

tttcctgtat taaagcaact cgatncccag 120 

ngattagcac aacagggaga aagaanatgc 180 

agctttgaca tgtatgctta ggaaagagaa 24 0 

tttctgttgc cntgganaaa gaatttgagc 300 

ananaggcnt tacnatacct tngaaaantc 360 

ttcttaacc 399 



<210> 45 

<211> 440 

<212> DNA 

<213> Homo sapiens 



<400> 45 

gcgggagcag aagctaaagc caaagcccaa 
accagtacca ataacagtgc cagtgccagt 
gccagcctga ccgccactct cacatttggg 
agcaccagtg gcagctctgg tgcctgtggt 
ccttggtttc agtggggaca tctggggctt 
taggaaggcc cangttggag aangatgtgn 
ttattccttt ctgtttgctg gangtcaatt 
canatatngt taatcctgcc 



gagagtggca gtgccagcac tggtgccagt 60 
gccagcacca gtggtggctt cagtgctggt 120 
ctcttcgctg gccttggtgg agctggtgcc 18 0 
ttctcctaca agtgagattt taggtatctg 24 0 
anggggcngg gataaggagc tggatgattc 300 
anagtgtgcc aagacactgc ttttggcatt 36 0 
gacccttnna ntttctctta cttgtgtttt 420 

440 



<210> 46 

<211> 472 

<212> DNA 

<213> Homo sapiens 



<400> 46 

gctctgtaat ttcacatttt aaaccttccc 
gtttctctgt tcctcttcac agcaaaaact 
ccactttctc acccccattc tcccctcaat 
tgtggctntt attanagtca ccaaccttat 
cttctcagca gcactcagct ctggtncttg 
cctcataaca atctccttcc cagcctctac 
ctaggctcag atgtagtgta gctcaaccct 
aaaaatgtcc atncntgtcc tgtgagtgat 



ttgacctcac attcctcttc ggccacctct 6 0 
gttcaaaaga gttgttgatt actttcattt 120 
taactctcct tcatccccat gatgccatta 180 
tctccaaaac anaagcaaca aggactttga 240 
aaacaccccc gttacttgct attcctccta 300 
tgctgccttc tctgagttct tcccagggtc 360 
gctacacaaa gnaatctcct gaaagcctgt 420 
ctnccangna naataacaaa tt 472 



<210> 47 

<211> 550 

<212> DNA 

<213> Homo sapiens 



<400> 47 

ccttcctccg cctggccatc cccagcatgc 
aggtcgggag cttcctcagt ggtctgtatg 
gggggccctg tctgggtgca aggcgacagc 
gtggagctgg gcgctcagtc catcgtgtat 
gcaggcttca gtgtggctgc cagtgtccgg 
gaagcaggca cggaagtcct ctaccgtttc 



tcatgctgtg catggagtgg tgggcctatg 60 
aggatggatg acggggactg gtgggaacct 120 
tgtctttctt caccaggcat cctcggcatg 180 
gaactggcca tcattgtgta catggtccct 240 
gtangaaacg ctctgggtgc tggagacatg 300 
cctgctgatt acagtgctct ttgctgtanc 360 
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cttcagtgtc ctgctgttaa gctgtaagga 

agaacatcat taatctggtg gctcaggtgg 

aagctcttgc tgctcaggta cacgccaatt 
gaattctgct 

<210> 48 
<211> 214 
<212> DNA 

<213> Homo sapiens 



tcacntgggg tacattttta ctaccgaccg 420 
ttccaattta tgctgtttcc cacctctttg 480 
ttgaaaagta aacaacgtgc ctcggagtgg 540 

550 



<400> 48 

agaaggacat aaacaagctg aacctgccca 
acaacctcct caacttcaag ctggtcatct 
agtttgtgtt cagttttaag gtgggccagg 
gtgagacnat ggtctatcac cccnacattg 



agacgtgtga tatcagcttc tcagatccag 60 

gtcctgatna gggcttctac nagagtggga 120 

gttacccgca tgatcccccc aaggtgaagt 180 
acct 214 



<210> 49 

<211> 267 

<212> DNA 

<213> Homo sapiens 



<400> 49 

atctgcctaa aatttattca aataatgaaa 
agtttttagg acaactatgc acaaatgtac 
gtngaggaac ttaatccaac cggagctntt 
tggcaaggaa tggagacnga gtttgcaaat 
ctgctnttgt gtctcccang gaaagtt 



atnaatctgt tttaagaaat tcagtctttt 60 
gatggagaat tctttttgga tnaactctag 120 
gtgaaggtca gaanacagga gagggaatct 180 
tgcagctaga gtnaatngtt ntaaatggga 24 0 

267 



<210> 50 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 50 

gactgggtca aagctgcatg aaaccaggcc 
ggagagaacc tgacttctct ttccctctcc 
ttgggatctt ctgagcttgt ttccctgctg 
gtctagaaga ggcagccctt ctttgtcctc 
ggagagacca anagcctctg atttttaatt 



ctggcagcaa cctgggaatg gctggaggtg 60 

ctcctccaac attactggaa ctctgtcctg 120 

ggtgggacag aggacaaagg agaagggagg 180 

tggggtnaat gagcttgacc tanagtagat 24 0 

tccataanat gttcnaagta tatntntacc 300 



<210> 51 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 51 

gggtaaaatc ctgcagcacc cactctggaa 
cccctatttc tagttggtcc aggattaggg 
aagcgctctc caagcacccc cggcctgggg 
gatcaggttn aataaatgga actcttcctg 
gggctctgtt agaggggacc tccaccctnn 



aatactgctc ttaattttcc tgaaggtggc 60 

atgtggggta tagggcattt aaatcctctc 120 

gtnagtttct catcccgcta ctgctgctgg 180 

tctggcctcc aaagcagcct aaaaactgag 24 0 

ggaagtccga ggggctnggg aagggtttct 300 



<210> 52 
<211> 267 
<212> DNA 
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<213> Homo sapiens 



<400> 52 

aaaatcaact tcntgcatta atanacanat 
cacctatcaa ggaacnnact tgattgcctc 
tacctgaata ccnccgcata actctcaacc 
atgctantcc cgaattcttc attatatcng 
gtatgtnccn taactgccga nncaang 



tctanancag gaagtgaana taattttctg 60 
tactnaacan atatatcgag ttnctatact 120 
nanatncntc nccacgacac tcnttcttna 180 
tgatgttcgn cctgntnata tatcagcaag 240 

267 



<210> 53 

<211> 401 

<212> DNA 

<213> Homo sapiens 

<400> 53 

agsctttagc atcatgtaga agcaaactgc 
caagattttg tgttttctag ctgtccagga 
agagcaagtg aaaccatttc cagcctaaac 
gacctctaag gctccataat catcattaaa 
tatatacagg attaaaatca acattaaatc 
ttgagcattt taaatagtac agtaggctgg 
caaatagaaa actaaagaaa ttagataggc 

<210> 54 
<211> 401 
<212> DNA 

<213> Homo sapiens 



acctatggct gagataggtg caatgaccta 60 
aaagccatct tcagtcttgc tgacagtcaa 120 
tacataaaag cagccgaacc aatgattaaa 180 
tatgcccaaa ctcattgtga ctttttattt 240 
atcttattta catggccatc ggtgctgaaa 300 
tatacattag gaaatggact gcactggagg 360 
tggaaatgct t 401 



<400> 54 

cccaacacaa tggataaaaa cacttatagt 
aagctacaga ttgtcatagt tgttttcctg 
cagtttgacc tttgtcttct ataatatttc 
tttgattctt aactaaaatt gttctcttaa 
ggtgtatttt ctttacctcc aaggaaagaa 
gcattgtttt ggtataaggt acatattttg 
gtgcatccaa tttattatag ttttgtaagt 



aaatggggac attcactata atgatctaag 60 
ctttacaaaa ttgctccaga tctggaatgc 120 
ctttttttcc cctctttgaa tctctgtata 180 
atattctgaa tcctggtaat taaaagtttg 240 
ctactagcta caaaaaatat tttggaataa 300 
gttgaagaca ccagactgaa gtaaacagct 360 
aacaatatgt a 401 



<210> 55 

<211> 933 

<212> DNA 

<213> Homo sapiens 



<400> 55 

tttactgctt ggcaaagtac cctgagcatc 
tcctagggga tgggtcttct attacctggg 
tgtgcatcaa ggaatgcctc cgcctctacg 
caaacccatc acctttccag atggacgctc 
tatttgggct cttcaccaca acccctattt 
gagattctcc agggaaaatt ctgaaaaaat 
tggattaagg aactgcattg ggcagcattt 
attaactctg ctccgcttca agctggctcc 
agttgcctca agtccaagaa tggaatccat 
gtccttttcg tataagaatt aakgagacaa 
aatataatac aaaatatatg tatatggttg 
tgactggttt tgacatccat taacagtaat 



agcagagatg ccgagatgaa atcagggaac 60 
aacacctgag ccagatgcct tacaccacga 120 
caccggtagt aaactatccc ggttactcga 180 
cttacctgca ggaataactg tgtttatcaa 240 
ctgggaagac cctcaggtct ttaacccctt 300 
acatccctat gccttcatac cattctcagc 360 
tgccataatt gagtgtaaag tggcagtggc 420 
agaccactca aggccaccca gctgtcgtca 480 
gtgtttgcaa aaaaagtttg ctaattttaa 540 
ttttcctacc aaaggaagaa caaaaggata 600 
tttgacaaat tatataactt aggatacttc 660 
tttaatttct ttgctgtatc tggtgaaacc 720 
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cacaaaaaca cctgaaaaaa ctcaagctga 
ggtaactagt ggtagagtgg ctttcaagca 
cattactttt atctctgcaa atatctgcat 
taataaaaaa tatctgccaa aaaaaaaaaa 

<210> 56 

<211> 480 

<212> DNA 

<213> Homo sapiens 



gttccaatgc gaagggaaat gattggtttg 780 
tagtttgatc aaaactccac tcagtatctg 840 
gatagcttta ttctcagtta tctttcccca 900 
aaa 933 



<400> 56 

ggctttgaag catttttgtc tgtgctccct 
gcagtcctgg tactcttggg agtttccatc 
gctgctccag ctgacacgta tccagctact 
gaaaccactg ctgctgcaac cactgcgacc 
gctcctacca ctgctcgtaa agacattcca 
aatggtagag tgtgtccctg agatggaatc 
attcatgctt cctgtgattt catccaacta 
ctaatcagtt tattttcttt caaataaaaa 



gatcttcagg tcaccaccat gaagttctta 60 
tttctggtct ctgcccagaa tccgacaaca 120 
ggtcctgctg atgatgaagc ccctgatgct 180 
actgctgctc ctaccactgc aaccaccgct 240 
gttttaccca aatgggttgg ggatctcccg 300 
agcttgagtc ttctgcaatt ggtcacaact 360 
cttaccttgc ctacgatatc ccctttatct 420 
ataactatga gcaacaaaaa aaaaaaaaaa 480 



<210> 57 

<211> 798 

<212> DNA 

<213> Homo sapiens 



<400> 57 

agcctacctg gaaagccaac cagtcctcat 
gactaacttt gtgatatggg aagtgaaaat 
agatgaccag agtactctta accccttaga 
gatggtattg ttttcatgag cttctagaaa 
tgttactgcc attcctattt acagtatatt 
catggggctt ttttggttgt cctaaactta 
tgattataat ttttgtgata atttctggcc 
tatatatttt aaatagattt gataggtttt 
aagttatttg gggttgtctg ggattgtgtg 
tttaccttgg tagtttattt gtggatggca 
tcttggattg ttttgcaaat tacagctgaa 
ggctagaata ggaagagaga aaaaatgaaa 
aaaattttta atgttaagaa aaccttaaat 
aaaaaaaaaa aaaaaaaa 



aatggacaag atccaccagc tcctcctgtg 60 
agttaacacc ttgcacgacc aaacgaacga 120 
actgtttttc cttttgtatc tgcaatatgg 180 
tttcacttgc aagtttattt ttgcttcctg 240 
tgagtgaatg attatatttt taaaaagtta 300 
caaacattcc actcattctg tttgtaactg 360 
tgattgaagg aaatttgaga ggtctgcatt 420 
taaattgctt tttttcataa ggtatttata 480 
aaagaaaatt agaaccccgc tgtatttaca 540 
gttttctgta gttttgggga ctgtggtagc 600 
atctgtgtca tggattaaac tggcttatgt 660 
tggttgttta ctaattttat actcccatta 720 
aaacatgatt gatcaatatg gaaaaaaaaa 780 

798 



<210> 58 

<211> 280 

<212> DNA 

<213> Homo sapiens 



<400> 58 

ggggcagctc ctgaccctcc acagccacct 
gaggtcccac tgagcctctc gcctgccccc 
cccctgcctg gtcccccaca aggactccca 
tggaccatgg tcgtgaggaa gggctcatgc 
cagaataaac cgagaaggaa accagaaaaa 



ggtcagccac cagctggggc aacgagggtg 6 0 
gccactcgtc tggtgcttgt tgatccaagt 120 
tccaggcccc ctctgccctg ccccttgtca 180 
cccttattta tgggaaccat ttcattctaa 240 
aaaaaaaaaa 280 



<210> 59 
<211> 382 
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<212> DNA 

<213> Homo sapiens 



<400> 59 

aggcgggagc agaagctaaa gccaaagccc 
gtaccagtac caataacagt gccagtgcca 
gtgccagcct gaccgccact ctcacatttg 
ccagcaccag tggcagctct ggtgcctgtg 
gttaatcctg ccagtctttc tcttcaagcc 
agcactctag gcagccacta tcaatcaatt 
ccattaaaaa aaaaaaaaaa aa 



aagagagtgg cagtgccagc actggtgcca 60 
gtgccagcac cagtggtggc ttcagtgctg 120 
ggctcttcgc tggccttggt ggagctggtg 180 
gtttctccta caagtgagat tttagatatt 240 
agggtgcatc ctcagaaacc tactcaacac 300 
gaagttgaca ctctgcatta aatctatttg 360 

382 



<210> 60 

<211> 602 

<212> DNA 

<213> Homo sapiens 



<400> 60 

tgaagagccg cgcggtggag ctgctgcccg 
ttgtggtgga gaatagtgcc cagcgggtca 
gggtcccatc ctcgtgagta ccgccactcc 
cttctcgacg gctggcagag atccaagaac 
aggcccgcag gaaggaggag gtctataagc 
gagatgtgtc ccggctggcc tacacccagc 
agcagaagga agagatcacc aagatcttgt 
actccctatc tgggaagctg gaccggacgt 
atgccaagaa ggacgatgct gttcggaagg 
actgcagcca gctcatccag accatcgagg 
ac 

<210> 61 

<211> 1368 

<212> DNA 

<213> Homo sapiens 



atgggactgc caaccttgcc aagctgcagc 60 
tccacttggc gggtcagtgg gagaagcacc 120 
gaaagctgca ggattgcaga gagctggaat 180 
tgcaccagag tgtccgggcg gctgctgaag 24 0 
agctgatgtc agagctggag actctgccca 300 
gcatcccgga gatcgtgggc aacatccgga 360 
ctgatacgaa ggagcttcag aaggaaatca 420 
ttgcggtgac tgatgagctt gtgttcaagg 480 
cctataagta tctagctgct ctgcacgaga 540 
acacaggcac catcatgcgg gaggttcgag 600 

602 



<400> 61 

ccagtgagcg cgcgtaatac gactcactat 
agcggccgcc cttttttttt tttttttatt 
tgaaaacagc taaaacttaa ttccaagcat 
gttcacaaaa cacattcaat gtttaaacac 
attgtttgct tctaaaaagc ccatttccct 
gcaatgaagt tattttgtac aatctagtta 
aaattgcaca gacgtatgca attttccaag 
ccaaagattg gttgcccata atagacttaa 
tacgaaaatg taaaaaagga aatgtaagtc 
ggatgctaaa gcaaaataaa tgtaggttct 
gcttctttga tatgccttag ggttacccat 
aatgaaaatt gatcaaatat acaccttgtc 
ccaaaaaaag ggtaggcatg aagaaaaaaa 
kgktctgata tggcagacar gatacaagtc 
ggtaatgggc tgacaaggta ttattgccag 
gtttttggat tatatagcac ctagagtctc 
acgctaaact tccaagggaa aatctttcag 
agttgcattg ctactgtgat tttttgaaaa 
agagagaaag attttagtct gtttagcaga 



agggcgaatt gggtaccggg ccccccctcg 60 
gatcagaatt caggctttat tattgagcaa 120 
gtgtagttaa agtttgcaaa gtgggatatt 180 
tatttatttg aagaacaaaa tatatttaaa 240 
ccaagtctaa actttgtaat ttgatattaa 300 
aacaagcaga atagcactag gcagaataaa 360 
atagcattct ttaaattcag ttttcagctt 420 
acatataatg atggctaaaa aaaataagta 480 
cactctcaat ctcataaaag gtgagagtaa 540 
ttttttctgt ttccgtttat catgcaatct 600 
ttaagttaga ggttgtaatg caatggtggg 660 
atttcatttc aaattgcggg ctggaaactt 720 
aaatcmaatc agaacctctt caggggtttg 780 
ccaccaggag atggagcaat tcaaaataag 840 
catgggacag aatgagcaac aggctgaaaa 900 
tgatgtaggg aatttttgtt agtcaaacat 960 
gtagcctaag cttgcttttc tagagtgatg 1020 
caaactgggt ttgtacaagt gagaaagact 1080 
agccatttta tctgcgtgca catggatcaa 1140 
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tatttctgat cccctatacc ccaggaaggg caaaatccca aagaaatgtg ttagcaaaat 12 00 

tggctgatgc tatcatattg ctatggacat tgatcttgcc caacacaatg gaattccacc 1260 

acactggact agtggatcca ctagttctag agcggccggc caccgcggtg gagctccagc 132 0 

ttttgttccc tttagtgagg gttaattgcg cgcttggcgt aatcatnn 1368 

<210> 62 

<:211> 924 

<212> DNA 

<213> Homo sapiens 

<400> 62 

caaaggnaca ggaacagctt gnaaagtact gncatncctn cctgcaggga ccagcccttt 60 

gcctccaaaa gcaataggaa atttaaaaga tttncactga gaaggggncc acgtttnart 120 

tntnaatgtn tcargnanar tnccttncaa atgncrnctn cactnactnr gnatttgggt 180 

tnccgnrtnc mgnactatnt caggtttgaa aaactggatc tgccacttat cagttatgtg 24 0 

accttaaaga actccgttaa tttctcagag cctcagtttc cttgtctata agttgggagt 300 

aatattaata ctatcatttt tccaaggatt gatgtgaaca ttaatgaggt gaaatgacag 360 

atgtgtatca tggttcctaa taaacatcca aaatatagta cttactattg tcattattat 420 

tacttgtttg aagctaaaga cctcacaata gaatcccatc cagcccacca gacagagytc 480 

tgagttttct agtttggaag agctattaaa taacaacktc tagtgtcaat tctatacttg 54 0 

ttatggtcaa gtaactgggc tcagcatttt acattcattg tctctttaag ttctagcaat 600 

gtgaagcagg aactatgatt atattgacta cataaatgaa gaaattgagg ctcagataca 660 

ttaagtaatt ctcccagggt cacacagcta gaactggcaa agcctgggat tgatccatga 720 

tcttccagca ttgaagaatc ataaatgtaa ataactgcaa ggccttttcc tcagaagagc 780 

tcctggtgct tgcaccaacc cactagcact tgttctctac aggggaacat ctgtgggcct 84 0 

gggaatcact gcacgtcgca agagatgttg cttctgatga attattgttc ctgtcagtgg 900 
tgtgaaggca aaaaaaaaaa aaaa 924 

<210> 63 

<211> 1079 

<212> DNA 

<213> Homo sapiens 

<400> 63 

agtcccaaga actcaataat ctcttatgtt ttcttttgaa gacttatttt aaatattaac 60 
tatttcggtg cctgaatgga aaaatataaa cattagctca gagacaatgg ggtacctgtt 120 
tggaatccag ctggcagcta taagcaccgt tgaaaactct gacaggcttt gtgccctttt 180 
tattaaatgg cctcacatcc tgaatgcagg aatgtgttcg tttaaataaa cattaatctt 240 
taatgttgaa ttctgaaaac acaaccataa atcatagttg gtttttctgt gacaatgatc 300 
tagtacatta tttcctccac agcaaaccta cctttccaga aggtggaaat tgtatttgca 360 
acaatcaggg caaaacccac acttgaaaag cattttacaa tattatatct aagttgcaca 420 
gaagacccca gtgatcacta ggaaatctac cacagtccag tttttctaat ccaagaaggt 480 
caaacttcgg ggaataatgt gtccctcttc tgctgctgct ctgaaaaata ttcgatcaaa 540 
acgaagttta caagcagcag ttattccaag attagagttc atttgtgtat cccatgtata 600 
ctggcaatgt ttaggtttgc ccaaaaactc ccagacatcc acaatgttgt tgggtaaacc 660 
accacatctg gtaacctctc gatcccttag atttgtatct cctgcaaata taactgtagc 720 
tgactctgga gcctcttgca ttttctttaa aaccattttt aactgattca ttcgttccgc 780 
agcatgccct ctggtgctct ccaaatggga tgtcataagg caaagctcat ttcctgacac 840 
attcacatgc acacataaaa ggtttctcat cattttggta cttggaaaag gaataatctc 900 
ttggcttttt aatttcactc ttgatttctt caacattata gctgtgaaat atccttcttc 960 
atgacctgta ataatctcat aattacttga tctcttcttt aggtagctat aatatggggg 1020 
aataacttcc tgtagaaata tcacatctgg gctgtacaaa gctaagtagg aacacaccc 1079 

<210> 64 
<211> 1001 
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<212> DNA 

<213> Homo sapiens 

<400> 64 

gaatgtgcaa cgatcaagtc agggtatctg tggtatccac cactttgagc atttatcgat 60 
tctatatgtc aggaacattt caagttatct gttctagcaa ggaaatataa aatacttata 120 
gttaactatg gcctatctac agtgcaacta aaaactagat tttattcctt tccacctgtg 180 
ggtttgtatt catttaccac cctcttttca ttccctttct cacccacaca ctgtgccggg 240 
cctcaggcat atactattct actgtctgtc tctgtaagga ttatcatttt agcttccaca 300 
tatgagagaa tgcatgcaaa gtttttcttt ccatgtctgg cttatttcac ttaacataat 360 
gacctccgct tccatccatg ttatttatat tacccaatag tgttcataaa tatatataca 420 
cacatatata ccacattgca tttgtccaat tattcattga cggaaactgg ttaatgttat 480 
atcgttgcta ttgtggatag tgctgcaata aacacgcaag tggggatata atttgaagag 54 0 
tttttttgtt gatgttcctc caaattttaa gattgttttg tctatgtttg tgaaaatggc 600 
gttagtattt tcatagagat tgcattgaat ctgtagattg ctttgggtaa gtatggttat 660 
tttgatggta ttaatttttt cattccatga agatgagatg tctttccatt gtttgtgtcc 720 
tctacatttt ctttcatcaa agttttgttg tatttttgaa gtagatgtat ttcaccttat 780 
agatcaagtg tattccctaa atattttatt tttgtagcta ttgtagatga aattgccttc 840 
ttgatttctt tttcacttaa ttcattatta gtgtatggaa atgttatgga tttttatttg 900 
ttggttttta atcaaaaact gtattaaact tagagttttt tgtggagttt ttaagttttt 960 
ctagatataa gatcatgaca tctaccaaaa aaaaaaaaaa a lOOi 

<210> 65 

<211> 575 

<212> DNA 

<213> Homo sapiens 

<400> 65 

acttgatata aaaaggatat ccataatgaa tattttatac tgcatccttt acattagcca 60 
ctaaatacgt tattgcttga tgaagacctt tcacagaatc ctatggattg cagcatttca 120 
cttggctact tcatacccat gccttaaaga ggggcagttt ctcaaaagca gaaacatgcc 180 
gccagttctc aagttttcct cctaactcca tttgaatgta agggcagctg gcccccaatg 24 0 
tggggaggtc cgaacatttt ctgaattccc attttcttgt tcgcggctaa atgacagttt 300 
ctgtcattac ttagattccc gatctttccc aaaggtgttg atttacaaag aggccagcta 360 
atagccagaa atcatgaccc tgaaagagag atgaaatttc aagctgtgag ccaggcagga 420 
gctccagtat ggcaaaggtt cttgagaatc agccatttgg tacaaaaaag atttttaaag 480 
cttttatgtt ataccatgga gccatagaaa ggctatggat tgtttaagaa ctattttaaa 540 
gtgttccaga cccaaaaagg aaaaaaaaaa aaaaa 575 

<210> 66 

<211> 831 

<212> DNA 

<213> Homo sapiens 

<400> 66 

attgggctcc ttctgctaaa cagccacatt gaaatggttt aaaagcaagt cagatcaggt 60 
gatttgtaaa attgtattta tctgtacatg tatgggcttt taattcccac caagaaagag 120 
agaaattatc tttttagtta aaaccaaatt tcacttttca aaatatcttc caacttattt 180 
attggttgtc actcaattgc ctatatatat atatatatat gtgtgtgtgt gtgtgtgcgc 24 0 
gtgagcgcac gtgtgtgtat gcgtgcgcat gtgtgtgtat gtgtattatc agacataggt 300 
ttctaacttt tagatagaag aggagcaaca tctatgccaa atactgtgca ttctacaatg 360 
gtgctaatct cagacctaaa tgatactcca tttaatttaa aaaagagttt taaataatta 420 
tctatgtgcc tgtatttccc ttttgagtgc tgcacaacat gttaacatat tagtgtaaaa 480 
gcagatgaaa caaccacgtg ttctaaagtc tagggattgt gctataatcc ctatttagtt 54 0 
caaaattaac cagaattctt ccatgtgaaa tggaccaaac tcatattatt gttatgtaaa 600 
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tacagagttt taatgcagta tgacatccca caggggaaaa gaatgtctgt agtgggtgac 660 

tgttatcaaa tattttatag aatacaatga acggtgaaca gactggtaac ttgtttgagt 720 

tcccatgaca gatttgagac ttgtcaatag caaatcattt ttgtatttaa atttttgtac 780 

tgatttgaaa aacatcatta aatatcttta aaagtaaaaa aaaaaaaaaa a 831 

<210> 67 
<211> 590 
<212> DNA 
<213> Homo sapiens 

<400> 67 

gtgctctgtg tattttttta ctgcattaga cattgaatag taatttgcgt taagatacgc 60 

ttaaaggctc tttgtgacca tgtttccctt tgtagcaata aaatgttttt tacgaaaact 120 

ttctccctgg attagcagtt taaatgaaac agagttcatc aatgaaatga gtatttaaaa 180 

taaaaatttg ccttaatgta tcagttcagc tcacaagtat tttaagatga ttgagaagac 240 

ttgaattaaa gaaaaaaaaa ttctcaatca tatttttaaa atataagact aaaattgttt 300 

ttaaaacaca tttcaaatag aagtgagttt gaactgacct tatttatact ctttttaagt 360 

ttgttccttt tccctgtgcc tgtgtcaaat cttcaagtct tgctgaaaat acatttgata 420 

caaagttttc tgtagttgtg ttagttcttt tgtcatgtct gtttttggct gaagaaccaa 480 

gaagcagact tttcttttaa aagaattatt tctctttcaa atatttctat cctttttaaa 540 

aaattccttt ttatggctta tatacctaca tatttaaaaa aaaaaaaaaa 590 

<210> 68 

<211> 301 
<212> DNA 
<213> Homo sapiens 

<400> 68 

ttgtgttggg gttccctttt ccggtcggcg 
cgggcctgca ccatgagcgt cccggccttc 
gagcttcgtg cttatctgaa atctaaagga 
ggacttcatg ttgatttagc tcaaattatt 
gataaagatg ttgaaagtgt gatgaacagt 
c 

<210> 69 
<211> 301 
<212> DNA 
<213> Homo sapiens 

<400> 69 

tctatgagca tgccaaggct ctgtgggagg atgaaggagt gcgtgcctgc tacgaacgct 
ccaacgagta ccagctgatt gactgtgccc agtacttcct ggacaagatc gacgtgatca 
agcaggctga ctatgtgccg agcgatcagg acctgcttcg ctgccgtgtc ctgacttctg 
gaatctttga gaccaagttc caggtggacn aagtcaactt ccacatgntt gacgtgggtg 
gccagcgcga tgaacgccgc aagtggatcc agtgcttcaa cgatgtgact gccatcatct 
t 

<210> 70 
<211> 201 
<212> DNA 
<213> Homo sapiens 

<400> 70 

gcggctcttc ctcgggcagc ggaagcggcg cggcggtcgg agaagtggcc taaaacttcg 60 



tggtcttgcg agtggagtgt 
atcgacatca gtgaagaaga 
gctgagattt cagaagagaa 
gaagcctgtg atgtgtgtct 
ggggnatcct actcttgatc 



ccgctgtgcc 60 
tcaggctgct 120 
ctcggaaggt 180 
gaaggaggat 24 0 
cggaanccna 300 
301 



60 
120 
180 
240 
300 
301 
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gcgttgggtg aaagaaaatg gcccgaacca agcagactgc tcgtaagtcc accggtggga 120 
aagccccccg caaacagctg gccacgaaag ccgccaggaa aagcgctccc tctaccggcg 18 0 
gggtgaagaa gcctcatcgc t 201 



<210> 71 

<211> 301 

<212> DNA 

<213> Homo sapiens 

<400> 71 

gccggggtag tcgccgncgc cgccgccgct gcagccactg caggcaccgc tgccgccgcc 6 0 

tgagtagtgg gcttaggaag gaagaggtca tctcgctcgg agcttcgctc ggaagggtct 120 

ttgttccctg cagccctccc acgggaatga caatggataa aagtgagctg gtacanaaag 180 

ccaaactcgc tgagcaggct gagcgatatg atgatatggc tgcagccatg aaggcagtca 240 

cagaacaggg gcatgaactc ttcaacgaag agagaaatct gctctctggt gcctacaaga 300 

a 301 

<210> 72 

<211> 251 

<212> DNA 

<213> Homo sapiens 

<400> 72 

cttggggggt gttgggggag agactgtggg cctggaaata aaacttgtct cctctaccac 60 

caccctgtac cctagcctgc acctgtccac atctctgcaa agttcagctt ccttccccag 120 

gtctctgtgc actctgtctt ggatgctctg gggagctcat gggtggagga gtctccacca 180 

gagggaggct caggggactg gttgggccag ggatgaatat ttgagggata aaaattgtgt 240 

aagagccaan g 251 



<210> 73 

<211> 913 

<212> DNA 

<213> Homo sapiens 

<400> 73 

tttttttttt tttttcccag gccctctttt tatttacagt gataccaaac catccacttg 60 

caaattcttt ggtctcccat cagctggaat taagtaggta ctgtgtatct ttgagatcat 120 

gtatttgtct ccactttggt ggatacaaga aaggaaggca cgaacagctg aaaaagaagg 180 

gtatcacacc gctccagctg gaatccagca ggaacctctg agcatgccac agctgaacac 240 

ttaaaagagg aaagaaggac agctgctctt catttatttt gaaagcaaat tcatttgaaa 300 

gtgcataaat ggtcatcata agtcaaacgt atcaattaga ccttcaacct aggaaacaaa 360 

attttttttt tctatttaat aatacaccac actgaaatta tttgccaatg aatcccaaag 420 

atttggtaca aatagtacaa ttcgtatttg ctttcctctt tcctttcttc agacaaacac 480 

caaataaaat gcaggtgaaa gagatgaacc acgactagag gctgacttag aaatttatgc 540 

tgactcgatc taaaaaaaat tatgttggtt aatgttaatc tatctaaaat agagcatttt 600 

gggaatgctt ttcaaagaag gtcaagtaac agtcatacag ctagaaaagt ccctgaaaaa 660 

aagaattgtt aagaagtata ataacctttt caaaacccac aatgcagctt agttttcctt 72 0 

tatttatttg tggtcatgaa gactatcccc atttctccat aaaatcctcc ctccatactg 780 

ctgcattatg gcacaaaaga ctctaagtgc caccagacag aaggaccaga gtttctgatt 84 0 

ataaacaatg atgctgggta atgtttaaat gagaacattg gatatggatg gtcagcccaa 900 

cacaatggaa ttc 913 
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<210> 74 
<211> 351 
<212> DNA 

<213> Homo sapiens 
<400> 74 

tgtgcncagg ggatgggtgg gcngtggaga ngatgacaga aaggctggaa ggaanggggg 60 
tgggtttgaa ggccanggcc aaggggncct caggtccgnt tccgnnaagg gacagccttg 120 
aggaagga'gn catggcaagc catagctagg ccaccaatca gattaagaaa nnctgagaaa 180 
nctagctgac catcactgtt ggtgnccagt ttcccaacac aatggaatnc caccacactg 240 
gactagngga nccactagtt ctagagcggc cgccaccgcg gtggaacccc aacttttgcc 300 
cctttagnga gggttaattg cgcgcttggc . ntaatcatgg tcataagctg t 351 

<210> 75 

<211> 251 

<212> DNA 

<213> Homo sapiens 

<400> 75 

tacttgacct tctttgaaaa gcattcccaa aatgctctat tttagataga ttaacattaa 60 

ccaacataat tttttttaga tcgagtcagc ataaatttct aagtcagcct ctagtcgtgg 120 

ttcatctctt tcacctgcat tttatttggt gtttgtctga agaaaggaaa gaggaaagca 180 

aatacgaatt gtactatttg taccaaatct ttgggattca ttggcaaata atttcagtgt 240 

ggtgtattat t 251 

<210> 76 

<211> 251 

<212> DNA 

<213> Homo sapiens 

<400> 76 

tatttaataa tacaccacac tgaaattatt tgccaatgaa tcccaaagat ttggtacaaa 60 

tagtacaatt cgtatttgct ttcctctttc ctttcttcag acaaacacca aataaaatgc 120 

aggtgaaaga gatgaaccac gactagaggc tgacttagaa atttatgctg actcgatcta 180 

aaaaaaatta tgttggttaa tgttaatcta tctaaaatag agcattttgg gaatgctttt 240 
caaagaaggt c 

<210> 77 

<211> 351 

<212> DNA 

<213> Homo sapiens 

<400> 77 

actcaccgtg ctgtgtgctg tgtgcctgct gcctggcagc ctggccctgc cgctgctcag 60 

gaggcgggag gcatgagtga gctacagtgg gaacaggctc aggactatct caagagannn 120 

tatctctatg actcagaaac aaaaaatgcc aacagtttag aagccaaact caaggagatg 180 

caaaaattct ttggcctacc tataactgga atgttaaact cccgcgtcat agaaataatg 240 

cagaagccca gatgtggagt gccagatgtt gcagaatact cactatttcc aaatagccca 300 

aaatggactt ccaaagtggt cacctacagg atcgtatcat atactcgaga c 351 

<210> 78 

<211> 1592 

<212> DNA 

<213> Homo sapiens 
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<400> 78 

gaattccatt gtgttggggc cctgggggcg gaggggaggg gcccaccacg gccttatttc 60 

cgcgagcgcc ggcactgccc gctccgagcc cgtgtctgtc gggtgccgag ccaactttcc 120 

tgcgtccatg cagccccgcc ggcaacggct gcccgctccc tggtccgggc ccaggggccc 180 

gcgccccacc gccccgctgc tcgcgctgct gctgttgctc gccccggtgg cggcgcccgc 240 

ggggtccggg gaccccgacg accctgggca gcctcaggat gctggggtcc cgcgcaggct 300 

cctgcagcag gcggcgcgcg cggcgcttca cttcttcaac ttccggtccg gctcgcccag 360 

cgcgctgcga gtgctggccg aggtgcagga gggccgcgcg tggattaatc caaaagaggg 420 

atgtaaagtt cacgtggtct tcagcacaga gcgctacaac ccagagtctt tacttcagga 480 

aggtgaggga cgtttgggga aatgttctgc tcgagtgttt ttcaagaatc agaaacccag 540 

accaactatc aatgtaactt gtacacggct catcgagaaa aagaaaagac aacaagagga 600 

ttacctgctt tacaagcaaa tgaagcaact gaaaaacccc ttggaaatag tcagcatacc 660 

tgataatcat ggacatattg atccctctct gagactcatc tgggatttgg ctttccttgg 720 

aagctcttac gtgatgtggg aaatgacaac acaggtgtca cactactact tggcacagct 780 

cactagtgtg aggcagtgga aaactaatga tgatacaatt gattttgatt atactgttct 840 

acttcatgaa ttatcaacac aggaaataat tccctgtcgc attcacttgg tctggtaccc 900 

tggcaaacct cttaaagtga agtaccactg tcaagagcta cagacaccag aagaagcctc 960 

cggaactgaa gaaggatcag ctgtagtacc aacagagctt agtaatttct aaaaagaaaa 1020 

aatgatcttt ttccgacttc taaacaagtg actatactag cataaatcat tcttctagta 1080 

aaacagctaa ggtatagaca ttctaataat ttgggaaaac ctatgattac aagtaaaaac 1140 

tcagaaatgc aaagatgttg gttttttgtt tctcagtctg ctttagcttt taactctgga 1200 

agcgcatgca cactgaactc tgctcagtgc taaacagtca ccagcaggtt cctcagggtt 1260 

tcagccctaa aatgtaaaac ctggataatc agtgtatgtt gcaccagaat cagcattttt 1320 

tttttaactg caaaaaatga tggtctcatc tctgaattta tatttctcat tcttttgaac 1380 

atactatagc taatatattt tatgttgcta aattgcttct atctagcatg ttaaacaaag 144 0 

ataatatact ttcgatgaaa gtaaattata ggaaaaaaat taactgtttt aaaaagaact 1500 

tgattatgtt ttatgatttc aggcaagtat tcatttttaa cttgctacct acttttaaat 156 0 

aaatgtttac atttctaaaa aaaaaaaaaa aa 1592 

<210> 79 

<211> 401 

<212> DNA 

<213> Homo sapiens 

<400> 79 

catactgtga attgttcttg actccttttc ttgacattca gttttcanaa tttccatctt 60 

tcttctggaa ctaatgtgct gttctcttga ctgcctgctg ggccagcatc cgattgccag 120 

ccagaaacgt cacactgccc aagatggcca ggtacttcaa ggtctggaac atgttgagct 180 

gagtccagta gacatacatg agtcccagca tagcagcatg tcccaggtga aatataatcg 240 

tgctaggagc aaaagtgaag ttggagacat tggcaccaat ccggatccac tagttctaga 300 

gcggccgcca ccgcggtgga gctccagctt ttgttccctt tagtgagggt taattgcgcg 360 

cttggcgtaa tcatggncat agctgtttcc tgtgtgaaat t 4 01 

<210> 80 

<211> 301 

<212> DNA 

<213> Homo sapiens 

<400> 80 

aaaaatgaaa catctatttt agcagcaaga ggctgtgagg gatggggtag aaaaggcatc 60 

ctgagagagt tctagaccga cccaggtcct gtggcacact atacgggtca ggaggggtgg 120 

aagacaggcc taagctctag gacggtgaat ctcggggcta tttgtggatt tgttagaaac 180 

agacattctt ttggcctttt cctggcactg gtgttgccgg caggtgggca gaagtgagcc 24 0 

accagtcact gttcagtcat tgccaccaca gatcttcagc agaatcttcc ggtaatcccc 300 
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<210> 81 

<211> 301 

<212> DNA 

<213> Homo sapiens 



<400> 81 

tagccaggtt gctcaagcta attttattct ttcccaacag gatccatttg gaaaatatca 60 

agcctttaga atgtggcagc aagagaaagc ggactacgca ggaacgggga gtttgggaga 120 

agctctcctg gtgttgactt agggatgaag gctccaggct gctgccagaa atggagtcac 180 

cagcagaaga actgntttct ctgataagga tgtcccacca ttttcaagct gttcgttaaa 240 

gttacacagg tccttcttgc agcagtaagt accgttagct cattttccct caagcgggtt 300 

t 301 

<210> 82 

<211> 201 

<212> DNA 

<213> Homo sapiens 



<400> 82 

tcaacagaca aaaaaagttt attgaataca aaactcaaag gcatcaacag tcctgggccc 60 

aagagatcca tggcaggaag tcaagagttc tgcttcaggg tcggtctggg cagccctgga 120 

agaagtcatt gcacatgaca gtgatgagtg ccaggaaaac agcatactcc tggaaagtcc 180 

acctgctggn cactgnttca t 201 



<210> 83 

<211> 251 

<212> DNA 

<213> Homo sapiens 



<400> 83 

gtaaggagca tactgtgccc atttattata gaatgcagtt aaaaaaaata ttttgaggtt 60 

agcctctcca gtttaaaagc acttaacaag aaacacttgg acagcgatgc aatggtctct 120 

cccaaaccgg ctccctctta ccaagtaccg taaacagggt ttgagaacgt tcaatcaatt 180 

tcttgatatg aacaatcaaa gcatttaatg caaacatatt tgcttctcaa anaataaaac 240 

cattttccaa a 251 

<210> 84 

<211> 301 

<212> DNA 

<213> Homo sapiens 



<400> 84 

agtttataat gttttactat gatttagggc ttttttttca aagaacaaaa attataagca 60 
taaaaactca ggtatcagaa agactcaaaa ggctgttttt cactttgttc agattttgtt 120 
tccaggcatt aagtgtgtca tacagttgtt gccactgctg ttttccaaat gtccgatgtg 180 
tgctatgact gacaactact tttctctggg tctgatcaat tttgcagtan accattttag 240 
ttcttacggc gtcnataaca aatgcttcaa catcatcagc tccaatctga agtcttgctg 
c 



300 
301 



<210> 85 

<211> 201 

<212> DNA 

<213> Homo sapiens 
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<400> 85 

tatttgtgta tgtaacattt attgacatct acccactgca agtatagatg aataagacac 60 

agtcacacca taaaggagtt tatccttaaa aggagtgaaa gacattcaaa aaccaactgc 120 

aataaaaaag ggtgacataa ttgctaaatg gagtggagga acagtgctta tcaattcttg 180 

attgggccac aatgatatac c 201 



<210> 86 
<211> 301 
<212> DNA 

<213> Homo sapiens 



<400> 86 

tttataaaat attttattta cagtagagct ttacaaaaat agtcttaaat taatacaaat 60 

cccttttgca atataactta tatgactatc ttctcaaaaa cgtgacattc gattataaca 120 

cataaactac atttatagtt gttaagtcac cttgtagtat aaatatgttt tcatcttttt 180 

tttgtaataa ggtacatacc aataacaatg aacaatggac aacaaatctt attttgntat 24 0 

tcttccaatg taaaattcat ctctggccaa aacaaaatta accaaagaaa agtaaaacaa 300 

t 301 



<210> 87 
<211> 351 
<212> DNA 

<213> Homo sapiens 



<400> 87 

aaaaaagatt taagatcata aataggtcat tgttgtcaca acacatttca gaatcttaaa 60 

aaaacaaaca ttttggcttt ctaagaaaaa gacttttaaa aaaaatcaat tccctcatca 120 

ctgaaaggac ttgtacattt ttaaacttcc agtctcctaa ggcacagtat ttaatcagaa 180 

tgccaatatt accaccctgc tgtagcanga ataaagaagc aagggattaa cacttaaaaa 240 

aacngccaaa ttcctgaacc aaatcattgg cattttaaaa aagggataaa aaaacnggnt 300 

aaggggggga gcattttaag taaagaangg ccaagggtgg tatgccngga c 351 



<210> 88 
<211> 301 
<212> DNA 

<213> Homo sapiens 



<400> 88 

gttttaggtc tttaccaatt tgattggttt atcaacaggg catgaggttt aaatatatct 60 

ttgaggaaag gtaaagtcaa atttgacttc ataggtcatc ggcgtcctca ctcctgtgca 120 

ttttctggtg gaagcacaca gttaattaac tcaagtgtgg cgntagcgat gctttttcat 180 

ggngtcattt atccacttgg tgaacttgca cacttgaatg naaactcctg ggtcattggg 24 0 

ntggccgcaa gggaaaggtc cccaagacac caaaccttgc agggtacctn tgcacaccaa 300 

c 301 



<210> 89 
<211> 591 
<212> DNA 

<213> Homo sapiens 



<400> 89 

tttttttttt tttttttatt aatcaaatga ttcaaaacaa ccatcattct gtcaatgccc 60 

aagcacccag ctggtcctct ccccacatgt cacactctcc tcagcctctc ccccaaccct 120 

gctctccctc ctcccctgcc ctagcccagg gacagagtct aggaggagcc tggggcagag 180 

ctggaggcag gaagagagca ctggacagac agctatggtt tggattgggg aagagattag 24 0 



wo 99/33869 



25 



PCTAJS98/27416 



gaagtaggtt cttaaagacc cttttttagt accagatatc cagccatatt cccagctcca 300 

ttattcaaat catttcccat agcccagctc ctctctgttc tccccctact accaattctt 360 

tggctcttac acaattttta tccctcaaat attcatccct ggcccaacca gtcccctgag 420 

cctccctctg gtggagactc ctccacccat gagctcccca gagcatccaa gacagagtgc 48 0 

acagagacct ggggaaggaa gctgaacttt gcagagatgt ggacaggtgc aggctagggt 54 0 

acagggtggt ggtagaggag acaagtttta tttccaggcc cacagtctct c 591 

<210> 90 

<211> 1996 

<212> DNA 

<213> Homo sapiens 

<400> 90 

tttttttttt ttttttatca aatgaatact ttattagaga cataacacgt ataaaataaa 60 

tttcttttca tcatggagtt accagatttt aaaaccaacc aacactttct catttttaca 120 

gctaagacat gttaaattct taaatgccat aatttttgtt caactgcttt gtcattcaac 180 

tcacaagtct agaatgtgat taagctacaa atctaagtat tcacagatgt gtcttaggct 240 

tggtttgtaa caatctagaa gcaatctgtt tacaaaagtg ccaccaaagc attttaaaga 300 

aaccaattta atgccaccaa acataagcct gctatacctg ggaaacaaaa aatctcacac 360 

ctaaattcta gcagagtaaa cgattccaac tagaatgtac tgtatatcca tatggcacat 420 

ttatgacttt gtaatatgta attcataata caggtttagg tgtgtggtat ggagctagga 48 0 

aaaccaaagt agtaggatat tatagaaaag atctgatgtt aagtataaag tcatatgcct 54 0 

gatttcctca aaccttttgt ttttcctcat gtcttctgtc tttatatttt tatcacaaac 600 

caagatctaa cagggttctt tctagaggat tattagataa gtaacacttg atcattaagc 660 

acggatcatg ccactcattc atggttgttc tatgttccat gaactctaat agcccaactt 720 

atacatggca ctccaagggg atgcttcagc cagaaagtaa agggctgaaa aagtagaaca 780 

atacaaaagc cctcgtgtgg tgggaactgt ggcctcactc ttacttgtcc ttccattcaa 840 

aacagtttgg cacctttcca tgacgaggat ctctacaggt aggttaaaat acttttctgt 900 

gctattcagc cagaaatagt ttttgtgctg gatatgattt taaaacagat tttgtctgtc 960 

accagtgcaa aaacattaca gatgtctggg ctaatacaaa aacacataag aatctacaac 1020 

tttatattta atactctatt caaatttaac tcaaagtaat gcaaaataat tagaagtaaa 1080 

aacttaattc ttctgagagc tctatttgga aaagcttcac atatccacac acaaatatgg 1140 

gtatattcat gcacagggca aacaactgta ttctgaagca taaataaact caaagtaaga 1200 

catcagtagc tagataccag ttccagtatt ggttaatggt ctctggggat cccattttaa 1260 

gcactctcag atgaggatct tgctcagttg ttagactatc attagtttga ttaagcaact 1320 

gaagtttact tcataaatta ctttttccta tatccaggac tctgcctgag aaattttata 1380 

cattcctcca aaggtaagta ttctccaaag gtaagtattt gactattaac acaaaggcaa 144 0 

tgtgattatt gcataatgac actaaatatt atgtggcttt tctgttaggt ttataagttt 1500 

tcaatgatca gttcaagaaa atgcagatca tatataacta aggttttaca ccagtggttg 1560 

acaaactatg gcccacaggc taaacccagc ctccccttgt ttttataaat aagttttatt 1620 

agacataacc acactcattc atttctgtat tgtgtatagc tgctttcacg ctatactagc 1680 

agaactgaat agttgtgaca gagactgtat ggaccgtgaa gcataaatat ttaccatctg 1740 

gcccattcta aaaaaagtgt gccaattcct ggtttacact aaaatataga gtttagtggg 1800 

aagcctattt gaaatgtgtt ttttttaggg gctgtaatta ccaattaaaa ttaaggttca 1860 

ggtgactcag caaccaaaca aaagggatac taatttttta tgaacaatat atttgtattt 1920 

tatggacata aaaggaaact ttcagaaaga aaaggaggaa aataaagggg gaaagggacc 1980 

caacacaatg gaattc 1996 

<210> 91 

<211> 911 

<212> DNA 

<213> Homo sapiens 



<400> 91 

gccctttttt tttttttttt cttgtttaaa aaaattgttt tcattttaat gatctgagtt 



60 
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agtaacaaac aaacgtacaa aattgtcttt cacatttcca tacattgtgt tatggaccaa 120 

atgaaaacgc tggactacaa atgcaggttt ctttatatcc ttaacttcaa ttattgtcac 180 

ttataaataa aggtgatttg ctaacacatg catttgtgaa cacagatgcc aaaaattata 24 0 

catgtaagtt aatgcacaac caagagtata cactgttcat ttgtgcagtt atgcgtcaaa 300 

tgcgactgac acagaagcag ttatcctggg atatttcact ctatatgaaa agcatcttgg 360 

agaaatagat tgaaatacag tttaaaacaa aaattgtatt ctacaaatac aataaaattt 420 

gcaacttgca catctgaagc aacatttgag aaagctgctt caataaccct gctgttatat 480 

tggttttata ggtatatctc caaagtcatg ggttgggata tagctgcttt aaagaaaata 540 

aatatgtata ttaaaaggaa aatcacactt taaaaatgtg aggaaagctt tgaaaacagt 6 00 

cttaatgcat gagtccatct acatattttc aagttttgga aacagaaaga agtttagaat 660 

tttcaaagta atctgaaaac tttctaagcc attttaaaat aagatttttt tccccatctt 720 

tccaatgttt cctatttgat agtgtaatac agaaatgggc agtttctagt gtcaacttaa 780 

ctgtgctaat tcataagtca ttatacattt atgacttaag agttcaaata agtggaaatt 840 

gggttataat gaaaatgaca agggggcccc ttcagcagcc actcatctga actagtaatc 900 

ccaacacaat g 911 

<210> 92 

<211> 1710 

<212> DNA 

<213> Homo sapiens 

<400> 92 

tttttttttt tttttaactt ttagcagtgt ttatttttgt taaaagaaac caattgaatt 60 

gaaggtcaag acaccttctg attgcacaga ttaaacaaga aagtattact tatttcaact 120 

ttacaaagca tcttattgat ttaaaaagat ccatactatt gataaagttc accatgaaca 180 

tatatgtaat aaggagacta aaatattcat tttacatatc tacaacatgt atttcatatt 240 

tctaatcaac cacaaatcat ataggaaaat atttaggtcc atgaaaaagt ttcaaaacat 300 

taaaaaatta aagttttgaa acaaatcaca tgtgaaagct cattaaataa taacattgac 360 

aaataaatag ttaatcagct ttacttatta gctgctgcca tgcatttctg gcattccatt 420 

ccaagcgagg gtcagcatgc agggtataat ttcatactat gcgaccgtaa agagctacag 480 

ggcttatttt tgaagtgaaa tgtcacaggg tctttcattc tctttcaaag gaagatcact 54 0 

catggctgct aaactgttcc catgaagagt accaaaaaag cacctttctg aaatgttact 600 

gtgaagattc atgacaacat atttttttta acctgttttg aaggagtttt gtttaggaga 660 

9999atgggc cagtagatgg agggtatctg agaagccctt ttctgtttta aaatataatg 720 

attcactgat gtttatagta tcaacagtct tttaagaaca atgaggaatt aaaactacag 780 

gatacgtgga atttaaatgc aaattgcatt catggatata cctacatctt gaaaaacttg 84 0 

aaaaggaaaa actattccca aagaaggtcc tgatacttaa gacagcttgc tgggtttgat 900 

caaagcagaa agcatatact ttcaagtgag aaaacagcag tggcaggctt gagtcttcca 960 

agcaatcaaa tctgtaaagc agatggttac tagtaagtct agttatggga gtctgagttc 1020 

taactcatgc tgtgcttgct ggatttgctg gctcttttcc gctctctgtg atgctggact 1080 

ggcttgtcag gtgacatgct ctcaaagttg tgactggact cgttgtgctg ccgggtgtac 1140 

ctcttgcact tgcaggcagt gactactgtg attttgtagg tgcgtgtgct gccatcttgg 1200 

cactgcagct ggattctctg ggtacgggtt ttgtcattga cacaccgcca ctcctgggag 126 0 

ctcctcctgc tccagtactt tgttccatag cctcctccaa tccagttagg gagcactggc 1320 

aggggcaagc actcgccagc acacaccagc tccttcagag ggctgatgct ggtgcactgg 1380 

ccatcagaga tgtatttggt ggaacgcagt tcccggcaac ccacttgaac ccgagtgttc 1440 

cgatccagtc cagtgttact gaaatgcctg cctccatttc tggcttgatt caacgtgctg 1500 

ttgctgctgg ggtgtgctgg aacaggttta accacatgtg aataaaggat ttctgtggca 1560 

tcatttttaa aagccaaaca gcttttcatt aggatgcatg caaggggaag gagatagaaa 1620 

tgaatggcag gaggaagcat ggtgagtaga ggatttgctt gactgaagag ctggttaatt 1680 

cttttgcctc tgcccaacac aatggaattc 1710 



<210> 93 
<211> 251 
<212> DNA 
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<213> Homo sapiens 
<400> 93 

cccaccctac ccaaatatta gacaccaaca cagaaaagct agcaatggat tcccttctac 60 

tttgttaaat aaataagtta aatatttaaa tgcctgtgtc tctgtgatgg caacagaagg 120 

accaacaggc cacatcctga taaaaggtaa gaggggggtg gatcagcaaa aagacagtgc 180 

tgtgggctga ggggacctgg ttcttgtgtg ttgcccctca agactcttcc cctacaaata 240 

actttcatat g 251 



<210> 94 

<211> 738 

<212> DNA 

<213> Homo sapiens 



<400> 94 

cccttttttt ttttttttcc acttctcagt ttatttctgg gactaaattt gggtcagagc 60 

tgcagagaag ggatgggccc tgagcttgag gatgaaagtg ccccagggag attgagacgc 120 

aacccccgcc ctggacagtt ttggaaattg ttcccagggt tcaactagag agacacggtc 180 

agcccaatgt gggggaagca gaccctgagt ccaggagaca tggggtcagg ggctggagag 24 0 

atgaacattc tcaacatctc tgggaaggaa tgagggtctg aaaggagtgt cagggctgtc 300 

cctgcagcag gtggggatgc cggtgtgctg agtcctggga tgactcagga gttggcctgg 360 

atggtttcct ggatccactt ggtgaacttg cagaggttcg tgtagacacc cggtctgttg 420 

ggccgggcac aagggtaatc tccccaggac acgagtccct gcagggagcc attgcagacc 480 

acaggccccc cagaatcacc ctggcaggag tctctacctg ctttgtcacc ggcgcagaac 540 

atggtgtcat ctatctgtct cgggtaagca tcctcgcacc ttttctgact tagcacgctg 600 

atattcaagc actggaggac cttagggaag tgcacttggg ggctcttggt tgtcccccag 660 

ccagacacca agcactttgt cccagcagag ggacaatgag aggagacgtt gatgggtctg 720 

acatctttag tgggacga 738 
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Compounds and 
methods for the treatment 
and diagnosis of breast 
cancer are provided. The 
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polypeptides containing at 
least a portion of a breast 
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for immunotherapy of 
breast cancer comprising 
such polypeptides, or 
polynucleotide molecules 
encoding such polypeptides, 
are also provided, together 
with polynucleotide 
molecules for preparing the 
inventive polypeptides. 
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An isolated nucleic acid comprising the nucleotide sequence 
of SEQ,ID,N.3. encoding an imnunogenic portion of a breast 
protein or a variant of said protein, a vector and a host 
comprising the nucleic acid, its use for the production of 
the polypeptide or antibodies reactive with the polypeptide, 
its use for the manufacture of a vaccine, its use for the 
detection of breast proteins or nucleic acids. 
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